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NUCLEAR FERTILITY RESTORER GENES AND 
METHODS OF USE IN PLANTS 

CROSS-REFERENCE TO RELATED APPLICATIONS 

This application claims the priority benefit of U.S. Provisional Patent Application 
Serial No. 60/305,026 filed July 12, 2001, U.S. Provisional Patent Application Serial No. 
60/305,363 filed July 13, 2001, and U.S. Provisional Patent Application Serial No. 
60/308,736 filed July 30, 2001, the entire contents of which are hereby incorporated by 
reference. 

BACKGROUND OF THE INVENTION 
Field of the Invention 

[0001] This invention relates generally to nucleic acid sequences encoding proteins that 
restore fertility in a plant. 
Background Art 

[0002] Considerable effort is being devoted to the development of strategies to increase 
canola yields due to the importance of canola crops worldwide. One method of obtaining 
increased yields of canola involves the generation of hybrid canola plants. Due to a 
phenomenon termed "hybrid vigor", hybrid canola plants are higher yielding than non-hybrid 
canola plants (Grant, I. and Beversdorf, W., 1985, Can. J. Genet. Cytol. 27:472-478). In fact, 
manually produced hybrids can yield up to 50% more seed than either of their parental lines. 
To produce such hybrids on a large scale, however, self-pollination of at least one of the 
parents of the hybrid cross must be prevented. One means of preventing self-pollination is to 
incorporate the trait of cytoplasmic male sterility into a seed parent of the hybrid. 

[0003] Cytoplasmic male sterility (CMS) results in an inability of the plant to produce 
viable pollen. In some cases, pollen formation is blocked or aborted in a CMS plant because 
of a gene in a cytoplasmic organelle, the mitochondrion. This widespread and classic non- 
Mendelian trait results from rearrangements of the mitochondrial genome (Schnable, P.S. and 
Wise, R.P., 1998, Trends in Plant ScL, 3:175-180). Plants carrying the CMS trait are 
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incapable of self-pollination, and therefore, when a CMS line is planted alongside a male- 
fertile line, all the seed that forms on the sterile plants is a hybrid of the two parents. 
[0004] importantly however, use of the CMS trait in a hybridization scheme produces 
seeds that are male-sterile since, in most species, the trait is inherited maternally. While the 
fertility of the resultant seeds is unimportant in some crops (i.e., vegetables), fertility must be 
restored in the crops for which pollen production is required for formation of the harvested 
products, as in the case of fruit or seed crops such as canola. In order to restore fertility to the 
hybrids, specific dominant nuclear genes termed restorers of fertility (Rf) can be introduced 
into the hybrid plants to suppress the male-sterile phenotype (Schnable, P.S. and Wise, R.P., 

1998, Trends in Plant Sci., 3:175-180). Accordingly, the use of CMS for commercial seed 
production involves the use of three breeding lines, a male-sterile line (female parent), a 
maintainer line which is isogenic to the male-sterile line but does not contain a sterility 
inducing mitochondrial genome and a restorer line (male parent), 

[0005] A crop of particular interest herein is the oilseed crop of the species Brassica 
napus 9 commonly referred to as canola. A number of CMS systems have been reported in 
Brassica species. Five of the systems most commonly used for hybrid seed production are 
Polima (pol), nap, tournefortii, Kosena and Ogura (ogu). The form of CMS in Brassica 
napus which is currently thought to be potentially the most useful for hybrid seed production 
is the ogu system. The ogu system is based on the use of a hybrid cytoplasm in which the 
male sterility determinant is derived from a radish (Raphanus sativum) cytoplasm. Male 
sterility induced by ogu cytoplasm is more complete and more temperature stable than any of 
the other endogenous B. napus CMS systems. Analysis of the ogu mitochondrial genome has 
indicated that this form of CMS is specified by a novel open reading frame (QRF), orfl38, 
that encodes a polypeptide, ORF138 (Grelon et al., 1994, Mol. Gen. Genet. 243:540-547). 

[0006] Recently, a Brassica napus restorer line for the ogu system became available 
(Delourme, R. et al., 1995, Proc. 9 th Int. Rapseed Cong. Cambridge, UK 1:6-8). Using this 
restorer line, it was determined that restoration of fertility resulted in a decrease of the 
ORF138 protein in stamens as compared to un-restored, ogu sterile lines (Bellaui, M. et al., 

1999, Plant Mol. BioL 40:893-902). However, a drawback to these prior art ogu restorer 
lines is that hybrids produced using these lines have elevated glucosinolate levels. An 
elevation of glucosinolate levels in plants is problematic when the plants are used in animal 
feed because this compound causes digestive problems in animals. Elevated glucosinolate 
levels are undesirable in canola plants in particular since much of their value is derived from . 
their low levels of glucosinolate compounds. 
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[0007] The elevation of glucosinolate levels results from a dominant gene that is linked to 
the radish nuclear fertility restorer gene or genes, termed Rfo in the prior art. Rfo, like the ogu 
cytoplasm, has been introduced from the radish but recombination in the radish chromosomal 
region surrounding Rfo is suppressed in B. napus (Delounne R. et al., 1998, Theor. Appl. 
Genet. 97:129-134). Despite considerable effort by several groups, it has not yet been 
possible to develop stable B. napus lines in which Rfo has been efficiently dissociated from 
the glucosinolate gene and do not address other deficiencies in Rfo restorer lines, and 
therefore, the system is not widely implemented. 

[0008] Accordingly, what are needed in the art are improved lines of canola that can be 
used as restorers of fertility in hybridization systems. More particularly, it would be 
beneficial to provide restorer lines of canola containing one or more nuclear fertility restorer 
genes from Raphanus sativum, which genes are separated from the gene or genes causing 
increased levels of glucosinolate in the resultant hybrid plants. The present invention also 
provides a method of using the fertility restorer nucleic acid to select for transgenic plant 
cells by means of its capacity to restore pollen production to cytoplasmic male sterile plants. 

SUMMARY OF THE INVENTION 

[0009] This invention fulfills in part the need to isolate a nuclear fertility restorer locus 
and genes and provide improved restorer lines for plants, and canola in particular. The 
present invention provides a Rfo restorer region that contains a genus of isolated nuclear 
fertility restorer genes. In one embodiment, the nuclear fertility restorer genus is derived 
from a radish, comprises a pentatricopeptide (PPR) motif and is able to restore fertility in a 
male-sterile plant. In another preferred embodiment, the nuclear fertility restorer genus is 
lacking genes associated with increased glucosinolate traits. In a preferred embodiment, the 
male-sterile plant comprises the ogu male sterility determinant and the nuclear fertility 
restorer gene is derived from Raphanus sativum. In a more preferred embodiment, the male- 
sterile plant is Brassica napus. 

[0010] The present invention provides a nuclear fertility restorer genus as shown in SEQ 
ID NO: 87. In a preferred embodiment, the present invention provides a nuclear fertility 
restorer genus, located within Genes 14 through 30, as shown between positions 88,073 and 
198,041 of SEQ ID NO:87. hi a preferred embodiment, the nuclear fertility restorer gene is 
selected from Genes 15, 16, 17, 21, 22, 24, 26 and 27, as defined herein. In a preferred 
embodiment, the nuclear fertility restorer gene is a nucleotide sequence selected from SEQ 
ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:48 
SEQ ID NO:52, SEQ ID NO:54 and SEQ ID NO:89. In a preferred embodiment, the nuclear 
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fertility restorer gene encodes a protein comprising an amino acid sequence of Gene product 
15, 16, 17, 21, 22, 24, 26 and 27, as defined herein. In a preferred embodiment, the nuclear 
fertility restorer gene encodes a protein comprising an amino acid sequence selected from 
SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:41, SEQ ID NO:43, SEQ ID 
NO:47, SEQ ID NO:51, SEQ ID NO:53 and SEQ ID NO:88. In a preferred embodiment, the 
nuclear fertility restorer gene comprises a nucleotide sequence of Gene 16 as shown in SEQ 
ID NO:32 or Gene 26 as shown in SEQ ID NO:89. In a preferred embodiment, the nuclear 
fertility restorer gene encodes a protein comprising an amino acid sequence as shown in SEQ 
ID NO:31 or SEQ ID NO:88. 

[0011] The invention further provides an isolated plant transformation vector comprising 
a nuclear fertility restorer gene as described below, wherein expression of the vector in a host 
plant results in the plant's increased production of viable pollen. In a preferred embodiment, 
the host cells are located in a plant stamen, or more particularly, a plant anther. 
[0012] The present invention also provides plant cells, plant parts, plant seeds and plants 
comprising the nuclear fertility restorer genes, proteins and vectors described herein. In one 
embodiment, a plant seed according to the present invention comprises a nuclear fertility 
restorer nucleic acid, and accordingly, the plant seed is true breeding for the ability to restore 
fertility in a male-sterile plant. The invention further provides an agricultural product 
produced by any of the below-described plants, plant parts or plant seeds. 
[0013] The invention additionally provides a method of producing a hybrid plant 
comprising crossing a male-sterile plant with a restorer plant, wherein the restorer plant 
contains a nuclear fertility restorer nucleic acid described herein. The present invention also 
provides a method of restoring male fertility in a plant comprising introducing a nuclear 
fertility restorer nucleic acid into a male-sterile plant. The present invention also provides a 
method of increasing the production of viable pollen in a plant, including introducing a 
nuclear fertility restorer nucleic acid into a plant. The present invention also provides 
methods of using genetic markers from the sequences described herein to determine the 
presence of a nuclear fertility restorer genus in a plant. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1. Flowers of fl/o-fertility restored (A) and Ogura (ogu) cytoplasmic male sterile radish 
(Raphanus sativum) (B). Panel C allows direct comparison of fertility restored (left) and ogu 
CMS floral morphology. 
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Fig. 2. Genetic mapping of the radish Rfo locus with respect to B. napus (black lettering) and 
Arabidopsis BAC derived probes (underlined). Genetic distance is indicated in centimorgans 
(cM). 

Fig. 3 A. Representation of radish genetic maps in the Rfo region based on genetic mapping of 
Arabidopsis BAC-derived probes (map positions are indicated by dots on the solid lines; only 
two of the mapped makers, LI 2 and L40, are labeled). The lower bar represents the position 
of the same markers on the Physical map of the Arabidopsis genome. The lines between the 
two bars indicate the relationship between the physical position of the markers in Arabidopsis 
and the genetic position in radish. Note the duplication (rBl and rB2) and inversion (rB2) of 
Arabidopsis region B in the radish genetic map. Fig. 3B. Proposed physical relationship 
between the Arabidopsis and radish genomes in the Rfo region. The inversion of markers in 
the rB2 region of radish with respect to Arabidopsis suggest that the Rfo gene can be 
approached by employing Arabidopsis-derived markers from region A moving from positions 
cto a. 

Fig. 4. BAC and cosmid contig spanning the Rfo containing region of the radish genome. 
The horizontal bars represent different the region contained in different clones; BAC clones 
are indicated by the shaded bars. The symbols on the bars (triangles, squares, circles etc.) 
indicate the position of different markers used in radish genetic mapping or clone recovery 
experiments. 

Fig. 5. Schematic representation of the annotation for the portion of the 270 kb radish 
genomic sequence containing the Rfo locus. The sizes and locations of different predicted 
genes within the region are indicated by the smaller arrows below the long arrow that depicts 
the corresponding numerical coordinates of the sequence. The arrows indicate the direction 
of transcription; for genes indicated by arrows pointing to the right, the sense strand sequence 
is given in the Appendix; for arrows pointing to the left, the sense strand sequence will be the 
reverse complement of that given in the Appendix. Sequences and genes contained in 
different pRD400 (solid lines) and pOCA18 (dashed lines) clones used for transformation are 
indicated beneath the annotation schematic. 

Figs. 6A and 6B. Flowers of the Brassica napus Ogura CMS line used in transformation 
experiments (6A) and a TO transgenic plant transformed with the contruct P2-Sh31, 
containing Gene 16 (6B). Note the poorly developed stamens and anthers in the CMS line 
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and the larger, normal, fertile stamens and anthers in the transgenic Gene 16 fertility restored 
plant. Note too the yellow petal color of A napus flowers in comparison to the white petals 
of radish flowers. Flowers of the fertile plant recovered from transformation with the Bgl-5 
construct containing Gene 26 are identical in appearance to those of the Gene 16 
transformants. 

Figs. 7A and 7B. Left Panel: RT-PCR products generated using an oligo dT adapter primer 
and a primer internal to the Gene 16 open reading frame (3' RACE) from total floral RNA of 
restored fertile (lane 4) and ogu CMS (lane 5) radish plants. Lane 1: DNA size markers (1Kb 
DNA ladder). Lanes 2 and 3 are RT-PCR products from controls provided with the 3' RACE 
kit. Lane 6 is gene-specific positive control. Right panel: 5 9 RACE products generated from 
total RNA of restored fertile (lane 4) and ogu CMS radish (lane 5) plants using a primer 
internal to the Gene 16 open reading frame and an adaptor primer for the 5' end. Lanes 1, 2, 
3 and 6 are the equivalents of those in the left panel. 

Fig. 8. Schematic depicting the structure of Gene 16 mRNA. Exons are indicated by boxes 
and intron positions and sizes by the diagonal lines connecting the exon boxes. Positions of 
the start (ATG), stop (TAA) and polyadenylation sites are numbered with reference to the 
Sequence shown in SEQ ID NO:87. 

Fig. 9 Structure of the protein encoded by Gene 16 (Gene 16p), as deduced from the sequence 
of the full length cDNA. 

Fig. 10. Comparisons of the proteins encoded by Gene 16 (Gene 16p) and Gene 15 (Gene 
15p). Identical amino acids are indicated by dark shading. Sites at which an amino acid is 
replaced by a similar but non-identical amino acid are indicated by lighter shading. 

Fig. 11.5' RACE products generated from total RNA of restored fertile (lane 2) radish plants 
using a primer internal to the Gene 26 open reading frame and an adaptor primer for the 5' 
end. Lane 3 is a gene-specific positive control. RT-PCR products generated using an oligo 
dT primer and a primer internal to the Gene 26 open reading frame (3' RACE) from total 
floral RNA of restored fertile (lane 5) radish plants. Lanes 1 & '8: DNA size markers (1Kb 
DNA ladder). Lane 6: an RT-PCR product for a control provided with the 5' RACE kit. 
Lanes 4 and 7 demonstrate that no product is generated when the adaptor primer for the 5'ehd 
or the oligo dT adaptor primer was omitted from the reactions respectively. 
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Fig, 12. Schematic depicting the structure of Gene 26 mRNA. Translation start and stop 
sites and polyadenylation site are as indicated in Fig. 8. 

Fig. 13. Structural features of the protein encoded by Gene 26 (Gene 26p). The predicted N 
terminal mitochondrial targeting presequence is enclosed in the open boxes. Shaded regions 
indicate copies of the PPR domain repeats. 

DETAILED DESCRIPTION OF THE INVENTION 

[0014] The present invention may be understood more readily by reference to the 
following detailed description of the preferred embodiments of the invention and the 
Examples included herein. However, before the present compounds, compositions, and 
methods are disclosed and described, it is to be understood that this invention is not limited to 
specific nucleic acids, specific polypeptides, specific cell types, specific host cells, specific 
conditions, or specific methods, etc., as such may, of course, vary, and the numerous 
modifications and variations therein will be apparent to those skilled in the art. It is also to be 
understood that the terminology used herein is for the purpose of describing specific 
embodiments only and is not intended to be limiting. 

[0015] This invention fulfills in part the need to isolate a nuclear fertility restorer locus 
and genes and provide improved restorer lines for plants, and canola in particular. The 
present invention provides a Rfo restorer region that contains a genus of isolated nuclear 
fertility restorer genes. In one embodiment, the nuclear fertility restorer genus is derived 
from a radish, comprises pentatricopeptide (PPR) motifs and is able to restore fertility in a 
male-sterile plant. In another preferred embodiment, the nuclear fertility restorer genus is 
lacking genes associated with increased glucosinolate traits. In a preferred embodiment, the 
male-sterile plant comprises the ogu male sterility determinant and the nuclear fertility 
restorer gene is derived from Raphanus sativum. In a more preferred embodiment, the male- 
sterile plant is Brassica napus. 

[0016] The present invention provides a nuclear fertility restorer genus as shown in SEQ 
ID NO:87. In a preferred embodiment, the present invention provides a nuclear fertility 
restorer genus, located within Genes 14 through 30 as defined herein, as shown between 
positions 88,073 and 198,041 of SEQ ED NO:87. In a preferred embodiment, the nuclear 
fertility restorer gene is selected from Genes 15, 16, 17, 21, 22, 24, 26 and 27, as defined 
herein. In a preferred embodiment, the nuclear fertility restorer gene is a nucleotide sequence 
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selected from SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:42, SEQ ID 
NO:44, SEQ ID NO:48, SEQ ID NO:52, SEQ ID NO:54 and SEQ ID NO:89. In a preferred 
embodiment, the nuclear fertility restorer gene encodes a protein comprising an amino acid 
sequence of Gene product 15, 16, 17, 21, 22, 24, 26 and 27, as defined herein. In a preferred 
embodiment, the nuclear fertility restorer gene encodes a protein comprising an amino acid 
sequence selected from SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:41, 
SEQ ID NO:43, SEQ ID NO:47, SEQ ID NO:51, SEQ ID NO:53 and SEQ ID NO:88. In a 
preferred embodiment, the nuclear fertility restorer gene encodes a protein comprising an 
amino acid sequence as shown in SEQ ID NO:3 1 or SEQ ID NO:88. 
[0017] The invention further provides an isolated plant transformation vector comprising 
a nuclear fertility restorer gene as described below, wherein expression of the vector in a host 
plant results in the plant's increased production of viable pollen. In a preferred embodiment, 
the host cells are located in a plant stamen, or more particularly, a plant anther. 
[0018] The present invention also provides plant cells, plant parts, plant seeds and plants 
comprising the nuclear fertility restorer genes, proteins and vectors described herein. In one 
embodiment, a plant seed according to the present invention comprises a nuclear fertility 
restorer nucleic acid, and accordingly, the plant seed is true breeding for the ability to restore 
fertility in a male-sterile plant. The invention further provides an agricultural product 
produced by any of the below-described plants, plant parts or plant seeds. 
[0019] The invention additionally provides a method of producing a hybrid plant 
comprising crossing a male-sterile plant with a restorer plant, wherein the restorer plant 
contains a nuclear fertility restorer nucleic acid described herein. The present invention also 
provides a method of restoring male fertility in a plant comprising introducing a nuclear 
fertility restorer nucleic acid into a male-sterile plant. The present invention also provides a 
method of increasing the production of viable pollen in a plant, including introducing a 
nuclear fertility restorer nucleic acid into a plant. The present invention also provides 
methods of using genetic markers from the sequences described herein to determine the 
presence of a nuclear fertility restorer genus in a plant. 

[0020] A novel discovery described herein comprises the identification of the nucleic acid 
sequence that encodes the Rfo genetic locus in Raphanus sativum associated with restoration 
of fertility in male-sterile plants. It is to be understood that the Bfo genetic locus includes one 
or more introns, one or more exons, or a combination thereof. 

[0021] The present invention provides in a preferred embodiment specific genes from the 
genomic DNA of a radish Ogura restorer line. Each of these genes can, individually, 
completely restore male fertility to CMS lines carrying the Ogura cytoplasm. The genes each 
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encode apparently unrelated proteins. The availability of the isolated genes makes it possible 
to produce a restorer line by introducing the isolated gene or genes into B. napus plants for 
example by plant transformation. The resulting plants carry reduced agronomic deficiencies 
associated with the presence of Rfo-region radish DNA in B. napus, including the gene that 
elevates seed glucosinolate content. This method for producing B. napus restorer lines for 
ogu CMS is faster and less costly than any other currently known practice. 
[0022] While the genes of the present invention that restore ogu CMS bear no 
resemblance to the other restorer genes that have been characterized at the DNA or protein 
sequence levels, the invention provides that nuclear restorer genes for other CMS systems in 
other crops can be homologs of the radish Rfo genes. Thus, the knowledge of the molecular 
identity of the Rfo genes presented herein allows the facile isolation of nuclear restorers for 
many other crop species in which CMS is employed in hybrid production, such as sunflower 
and rice. 

[00231 The examples herein provide evidence that multiple specific genes within the 
sequence listed in the Appendix as SEQ ID NO: 87 can function as fertility restorer genes. 
Specifically, Gene 16 (SEQ ID NO:32) and Gene 26 (SEQ ID NO:89) are demonstrated to 
restore fertility in a CMS system. The invention provides that other genes within the region 
can also function as restorer genes. For example, two partially fertile plants were obtained 
after transformation with the Gene 15 (SEQ ID NO:30) construct. Thus Gene 15, like Gene 
1 6, can function as a restorer gene. Similarly, two partially fertile plants were recovered after 
transformation with the Gene 17 (SEQ ID NO:34) construct. A plant transformed with a 
construct containing Gene 21 (SEQ ID NO:42) and Gene 22 (SEQ ID NO:44) also produced 
some fertile flowers. Gene 24 (SEQ ID NO:48) and Gene 27 (SEQ ID NO:54) also contain 
PPR domains and are therefore expected to restore fertility phenotype according to the 
present invention. As discussed above, the preliminary analysis indicated that the restorer 
functions are most preferably located between Gene 14 and Gene 30 within the Rfo region as 
shown in the Appendix between positions 88,073 and 198,041 of SEQ ID NO:87. 
[0024] There are several reasons why not all of the transfonnants recovered following 
transformation with a specific construct necessarily show the same phenotype. The 
expression of the genes encoded in a specific construct may vary depending on the site of 
insertion, the number of copies of the gene at each insertion site, and other factors such as 
transgene silencing. Therefore, it is apparent that other genes in the genetically defined Rfo 
containing region will likely be found by routine analysis in view of the present disclosure to 
function as genes that confer complete fertility restoration. Given that Rfo maps as a single 
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genetic locus in radish, the invention provides that two or more different genes in the region 
are able to function as restorer genes for Ogura CMS in B. napus. 

[0025] The sequences given in the Appendix have utility in the development of effective, 
glucosinolate-free or glucosinolate-reduced restorer lines through approaches other than the 
transgenic approach illustrated above. In one envisioned example of this embodiment, any 
portion of the sequence can be used to design DNA markers that allow the radish Rfo region 
to be specifically detected in plant breeding experiments aimed a separating the high seed 
glucosinolate trait from the restorer gene. Such markers allow the identification of individual 
plants that contain the restorer gene region but have a minimal amount of associated radish 
DNA. This in turn aids significantly in the development of alternative low glucosinolate 
Ogura restorer lines. 

[0026] As mentioned above, the present invention provides a genus of isolated Rfo genes 
and proteins encoded thereby. Some members include a pentatricopeptide (PPR) motif, or 
consensus sequence, and are able to restore fertility in a male-sterile plant. As used herein, 
'TPR motif ' includes the following amino acid consensus sequence: 
VTYNTLISGYCKAGKLEEALELFKEMKEKGIKPDV (SEQ ID NO:90). A Rfo protein is 
defined herein as a protein whose amino acid sequence has one or more 35 amino acid 
regions having greater than 50% homology with a PPR motif or consensus sequence, and 
preferably comprises at least the single methionine and the tyrosine at position three of the 
motif. As also used herein, the term "male-sterile plant" refers to a plant wherein the 
function of the male organs is disrupted or blocked, or more particularly, wherein the male 
organs are unable to produce viable pollen. The term "male-sterile" includes genie male 
sterility and cytoplasmic-genetic male sterility. The terms "cytoplasmic-genetic male sterile" 
and "cytoplasmic male sterile" are used interchangeably herein. In a preferred embodiment, 
the nucleic acids and proteins of the present invention are used to restore fertility in a 
cytoplasmic male-sterile, or CMS, plant. In a further preferred embodiment, the cytoplasmic 
male-sterile plant comprises a cytoplasmic male sterility determinant selected from the group 
consisting of pol 9 nap, Tournefortii, Kosena and ogu. In a most preferred embodiment, the 
cytoplasmic male sterility determinant is ogu. 

[0027] The male-sterile plants of the present invention can be selected from maize, wheat, 
rye, oat, triticale, rice, barley, soybean, peanut, rapeseed, canola, pepper, sunflower, Vicia 
species, pea, alfalfa, bushy plants (coffee, cacao, tea), Salix species, radish, sorghum, pearl 
millet, cotton, and tobacco. It is preferable however that the male-sterile plant is a canola 
plant selected from the group of Brassica species consisting of Brassica napus, Brassica rapa 
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(or campestris), Brassica oleracea, Brassica nigra, Brassica juncea, Sinapis alba, and 
Brassica carinata. In a more preferred embodiment, the male-sterile plant is Brassica napus. 
[0028] The present invention encompasses a Rfo gene derived from a species of radish, 
including but not limited to, Raphanus sativum. In one embodiment of the present invention, 
the Rfo gene is derived from Raphanus sativum. Accordingly, the present invention provides 
a Rfo gene that, upon its introduction into a male-sterile plant, is able to increase the plant's 
production of pollen and/or restore the fertility of the plant. 

[0029] The invention further includes isolated amino acid sequences encoded by the Rfo 
genes provided herein. In a preferred embodiment, the nuclear fertility restorer protein 
(hereinafter **Rf protein" or "Rf amino acid sequence") comprises an amino acid sequence as 
shown herein. The present invention also includes homologs, orthologs and paralogs of the 
amino acid sequences shown herein. Homologs, orthologs and paralogs are further defined 
below. 

[0030] The terms "gene", "nucleic acid" and "nucleic acid molecule" are used 
interchangeably herein and are intended to include DNA molecules (e.g., cDNA or genomic 
DNA) and RNA molecules (e.g., xnRNA) and analogs of the DNA or KNA generated using 
nucleotide analogs. This term also encompasses untranslated sequence located at both the 3' 
and 5* ends of the coding region of the gene: up to at least about 1000 nucleotides of 
sequence upstream from the 5' end of the coding region and up to at least about 200 
nucleotides of sequence downstream from the 3' end of the coding region of the gene. The 
nucleic acid molecule can be single-stranded or double-stranded, but preferably is double- 
stranded DNA. 

[0031] An "isolated" nucleic acid molecule is one that is substantially separated from 
other nucleic acid molecules that are present in the natural source of the nucleic acid (i.e., 
sequences encoding other proteins). Preferably, an "isolated" nucleic acid is free of some of 
the sequences that naturally flank the nucleic acid (i.e., sequences located at the 5' and 3' 
ends of the nucleic acid) in its naturally occurring replicon. For example, a cloned nucleic 
acid is considered isolated. In one embodiment of the present invention, a Rfo nucleic acid is 
isolated when it is separated from all or part of the glucosinolate gene, for example in 
Raphanus sativum. In other various embodiments, the isolated Rfo nucleic acid molecule can 
contain less than about 5 kb, 4 kb, 3 kb, 2 kb, 1 kb, 0.5 kb or 0.1 kb of nucleotide sequences 
which naturally flank the nucleic acid molecule in genomic DNA of the cell from which the 
nucleic acid is derived (e.g., a Raphanus sativum cell). A nucleic acid is also considered 
isolated if it has been altered by human intervention, or placed in a locus or location that is 
not its natural site, or if it is introduced into a cell by Agrobacterium-mediated 
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transformation. Moreover, an isolated" nucleic acid molecule, such as a cDNA molecule, 
can be free from some of the other cellular material with which it is naturally associated, or 
culture medium when produced by recombinant techniques, or chemical precursors or other 
chemicals when chemically synthesized. 

[0032] Specifically excluded from the definition of "isolated nucleic acids" are: 
naturally-occurring chromosomes (such as chromosome spreads), artificial chromosome 
libraries, genomic libraries, and cDNA libraries that exist either as an in vitro nucleic acid 
preparations or as a transfected/transfonned host cell preparation, wherein the host cells are 
either an in vitro heterogeneous preparation or plated as a heterogeneous population of single 
colonies. Also specifically excluded are the above libraries wherein a specified nucleic acid 
makes up less than 5% of the number of nucleic acid inserts in the vector molecules. Further 
specifically excluded are whole cell genomic DNA or whole cell RNA preparations 
(including whole cell preparations that are mechanically sheared or enzymatically digested). 
Even further specifically excluded are the whole cell preparations found as either an in vitro 
preparation or as a heterogeneous mixture separated by electrophoresis wherein the nucleic 
acid of the invention has not further been separated from the heterologous nucleic acids in the 
electrophoresis medium (e.g., further separating by excising a single band from a 
heterogeneous band population in an agarose gel or nylon blot). 

[0033] A nucleic acid molecule of the present invention, or a portion thereof, can be 
isolated using standard molecular biology techniques and the sequence information provided 
herein. For example, a Rfo cDNA can be isolated from a Raphanus sativum library using all 
or a portion of the sequence herein. Moreover, a nucleic acid molecule encompassing all or a 
portion of sequence herein can be isolated by the polymerase chain reaction using 
oligonucleotide primers designed based upon this sequence. For example, mRNA can be 
isolated from radish cells (e.g., by the guanidinium-thiocyanate extraction procedure of 
Chirgwin et al., 1979 Biochemistry 18:5294-5299) and cDNA can be prepared using reverse 
transcriptase (e.g., Moloney MLV reverse transcriptase, available from Gibco/BRL, 
Bethesda, MD; or AMV reverse transcriptase, available from Seikagaku America, Inc., St 
Petersburg, FL). Synthetic oligonucleotide primers for polymerase chain reaction 
amplification can be designed based upon the nucleotide sequence shown herein. A nucleic 
acid molecule of the invention can be amplified using cDNA or, alternatively, genomic DNA, 
as a template and appropriate oligonucleotide primers according to standard PCR 
amplification techniques. The nucleic acid molecule so amplified can be cloned into an 
appropriate vector and characterized by DNA sequence analysis. Furthermore, 
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oligonucleotides corresponding to a Rfo nucleotide sequence can be prepared by standard 
synthetic techniques, e.g., using an automated DNA synthesizer. 

[0034] In a preferred embodiment, an isolated nucleic acid molecule of the invention 
comprises one of the nucleotide sequences shown herein. It is to be understood that 
sequences shown herein comprise whole genomic fragments isolated from genomic DNA. 
Accordingly, SEQ ID NO:87, for example contains both coding regions and 5' and 3* 
untranslated regions that can include promoters and other regulatory sequences. 
Alternatively, the nucleic acid molecules of the present invention can comprise only the 
coding region of SEQ ID NO:87. A coding region of these sequences is indicated as an 
"ORF position". The present invention also includes Rfo coding nucleic acids that encode 
Rfo proteins as described herein. 

[0035] Moreover, the nucleic acid molecule of the invention can comprise only a portion 
of the coding region of the sequences shown herein, for example, a fragment which can be 
used as a probe or primer or a fragment encoding a biologically active portion of a Rfo 
protein. The nucleotide sequences determined from the cloning of the Rfo genes from 
Raphanus sativum allow for the generation of probes and primers designed for use in 
identifying and/or cloning Rfo homologs in other cell types and organisms, as well as Rfo 
homologs from other radishes and related species. 

[0036] Portions of proteins encoded by the Rfo nucleic acid molecules of the invention are 
preferably biologically active portions of one of the Rfo proteins described herein. As used 
herein, the term biologically active portion of a Rfo protein is intended to include a portion, 
e.g., a domain/motif, of a Rfo that participates in the restoration of fertility in a cytoplasmic 
male-sterile plant. In a preferred embodiment, the biologically active portion of a Rfo protein 
comprises one or more PPR motifs as described above. To determine whether a Rfo protein, 
or a biologically active portion thereof, can restore fertility in a cytoplasmic male-sterile 
plant, a fertility analysis of a plant comprising the Rfo protein may be performed. Such 
analysis methods are well known to those skilled in the art. More specifically, nucleic acid 
fragments encoding biologically active portions of a Rfo protein can be prepared by isolating 
a portion of sequences shown herein, introducing the isolated portion of nucleic acid into a 
male-sterile plant and assessing whether male-fertility is restored. A determination as to 
whether male-fertility is restored in a plant can be made, for example, by 1) visually 
assessing an increase in the production of pollen as compared to a male-sterile plant or 2) 
determining that the plant can self-fertilize as evidenced by placing a bag over a flower on the 
plant and finding an increase of seed therein as compared to a male-sterile plant. It is to be 
understood that a male-sterile plant containing an ogu cytoplasmic male sterility determinant 
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can produce a small amount of pollen. In one embodiment of the present invention, 
restoration of fertility in a male-sterile plant is indicated by an increase in the plant's pollen 
production by at least 95%. 

[00371 Biologically active portions of a Rfo proteins are encompassed by the present 
invention and include peptides comprising amino acid sequences derived from the amino acid 
sequence of a Rfo protein, or the amino acid sequence of a protein homologous to a Rfo 
protein, which includes fewer amino acids than a full length Rfo protein or the full length 
protein which is homologous to a Rfo protein, and exhibit at least one activity of a Rfo 
protein. Typically, biologically active portions (e.g., peptides which are, for example, 5, 10, 
15, 20, 30, 35, 36, 37, 38, 39, 40, 50, 100 or more amino acids in length) comprise a domain 
or motif with at least one activity of a Rfo protein. Moreover, other biologically active 
portions in which other regions of the protein are deleted, can be prepared by recombinant 
techniques and evaluated for one or more of the activities described herein. Preferably, the 
biologically active portions of a Rfo protein include one or more PPR domains/motifs or 
portions thereof and are able to restore fertility in a cytoplasmic male-sterile plant. 
[0038] The invention also provides Rfo chimeric or fusion proteins. As used herein, a 
Rfo "chimeric protein" or "fusion protein" comprises a Rfo polypeptide operatively linked to 
a non-Rfo polypeptide. A Rfo polypeptide refers to a polypeptide having an amino acid 
sequence corresponding to a Rfo protein, whereas a non-Rfo polypeptide refers to a 
polypeptide having an amino acid sequence corresponding to a protein which is not 
substantially homologous to the Rfo, e.g., a protein that is different from the Rfo and is 
derived from the same or a different organism. Within the fusion protein, the term 
"operatively linked" is intended to indicate that the Rfo polypeptide and the non-Rfo 
polypeptide are fused to each other so that both sequences fulfill the proposed function 
attributed to the sequence used. The non-Rfo polypeptide can be fused to the N-terminus or 
C-terminus of the Rfo polypeptide. For example, in one embodiment, the fusion protein is a 
GST-Rfo fusion protein in which the Rfo sequences are fused to the C-terminus of the GST 
sequences. Such fusion proteins can facilitate the purification of recombinant Rfo proteins. 
In another embodiment, the fusion protein is a Rfo protein containing a heterologous signal 
sequence at its N-tenninus. 

[0039] Preferably, a Rfo chimeric or fusion protein of the invention is produced by 
standard recombinant DNA techniques. For example, DNA fragments coding for the 
different polypeptide sequences are ligated together in-frame in accordance with conventional 
techniques, for example by employing blunt-ended or stagger-ended termini for ligation, 
restriction enzyme digestion to provide for appropriate termini, filling-in of cohesive ends as 
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appropriate, alkaline phosphatase treatment to avoid undesirable joining and enzymatic 
ligation. In another embodiment, the fusion gene can be synthesized by conventional 
techniques including automated DNA synthesizers. Alternatively, PCR amplification of gene 
fragments can be carried out using anchor primers that give rise to complementary overhangs 
between two consecutive gene fragments which can subsequently be annealed and re- 
amplified to generate a chimeric gene sequence (see, for example, Current Protocols in 
Molecular Biology, Eds. Ausubel et al. John Wiley & Sons: 1992). Moreover, many 
expression vectors are commercially available that already encode a fusion moiety (e.g., a 
GST polypeptide). A Rfo encoding nucleic acid can be cloned into such an expression vector 
such that the fusion moiety is linked in-frame to the Rfo protein. 

[0040] In addition to fragments and fusion proteins of the Rfo proteins described herein, 
the present invention includes homologs and analogs of naturally occurring Rfo proteins and 
Rfo encoding nucleic acids in a plant "Homologs" are defined herein as two nucleic acids or 
proteins that have similar, or homologous", nucleotide or amino acid sequences, 
respectively. Homologs include allelic variants, orthologs, paralogs, agonists and antagonists 
of Rfos as defined hereafter. The term "homolog" further encompasses nucleic acid 
molecules that differ from the nucleotide sequence shown herein (and portions thereof) due to 
degeneracy of the genetic code and thus encode the same Rfo protein as that encoded by the 
nucleotide sequences shown herein. As used herein a "naturally occurring" Rfo protein refers 
to a Rfo amino acid sequence that occurs in nature. 

[0041] Nucleic acid molecules corresponding to natural homologs such as allelic variants, 
orthologs and paralogs and natural analogs of a Rfo cDNA can be isolated based on their 
identity to the Raphanus sativum Rfo nucleic acids described herein. These natural homologs 
and analogs can be isolated using Rfo cDNAs, or a portion thereof, as a hybridization probe 
according to standard hybridization techniques under stringent hybridization conditions. In 
an alternative embodiment, homologs of the Rfo protein can be identified by screening 
combinatorial libraries of mutants, e.g., truncation mutants, of the Rfo nucleic acids for Rfo 
protein agonist or antagonist activity. In one embodiment, a variegated library of Rfo variants 
is generated by combinatorial mutagenesis at the nucleic acid level and is encoded by a 
variegated gene library. A variegated library of Rfo variants can be produced by, for 
example, enzymatically ligating a mixture of synthetic oligonucleotides into gene sequences 
such that a degenerate set of potential Rfo sequences is expressible as individual 
polypeptides, or alternatively, as a set of larger fusion proteins (e.g., for phage display) 
containing the set of Rfo sequences therein. There are a variety of methods that can be used 
to produce libraries of potential Rfo homologs from a degenerate oligonucleotide sequence. 
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Chemical synthesis of a degenerate gene sequence can be performed in an automatic DNA 
synthesizer, and the synthetic gene is then ligated into an appropriate expression vector. Use 
of a degenerate set of genes allows for the provision, in one mixture, of all of the sequences 
encoding the desired set of potential Rfo sequences. Methods for synthesizing degenerate 
oligonucleotides are known in the art (see, e.g., Narang, S.A., 1983 Tetrahedron 39:3; Itakura 
et al., 1984 Annu. Rev. Biochem. 53:323; Itakura et al., 1984 Science 198:1056; Ike et al., 
1983 Nucleic Acid Res. 11:477). 

[0042] In addition, libraries of fragments of the Rfo coding regions can be used to 
generate a variegated population of Rfo fragments for screening and subsequent selection of 
homologs of a Rfo. In one embodiment, a library of coding sequence fragments can be 
generated by treating a double stranded PCR fragment of a Rfo coding sequence with a 
nuclease under conditions wherein nicking occurs only about once per molecule, denaturing 
the double stranded DNA, renaturing the DNA to form double stranded DNA, which can 
include sense/antisense pairs from different nicked products, removing single stranded 
portions from reformed duplexes by treatment with SI nuclease, and ligating the resulting 
fragment library into an expression vector. By this method, an expression library can be 
derived which encodes N-terminal, C-terminal and internal fragments of various sizes of the 
Rfo proteins. 

[0043] Several techniques are known in the art for screening gene products of 
combinatorial libraries made by point mutations or truncation, and for screening cDNA 
libraries for gene products having a selected property. Such techniques are adaptable for 
rapid screening of the gene libraries generated by the combinatorial mutagenesis of Rfo 
homologs. The most widely used techniques, which are amenable to high through-put 
analysis, for screening large gene libraries typically include cloning the gene library into 
replicable expression vectors, transforming appropriate cells with the resulting library of 
vectors, and expressing the combinatorial genes under conditions in which detection of a 
desired activity facilitates isolation of the vector encoding the gene whose product was 
detected. Recursive ensemble mutagenesis (REM), a new technique that enhances the 
frequency of functional mutants in the libraries, can be used in combination with the 
screening assays to identify Rfo homologs (Arkin and Yourvan, 1992 PNAS 89:7811-7815; 
Delgrave et al., 1993 Protein Engineering 6(3):327-331). In another embodiment, cell based 
assays can be exploited to analyze a variegated Rfo library, using methods well known in the 
art. The present invention further provides a method of identifying a novel Rfo protein, 
comprising (a) raising a specific antibody response to a Rfo protein, or a fragment thereof; as 
described above; (b) screening putative Rfo protein material with the antibody, wherein 
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specific binding of the antibody to the material indicates the presence of a potentially novel 
Rfo protein; and (c) analyzing the bound material in comparison to known Rfo proteins, to 
determine its novelty. 

[0044] Preferably, the above described Rfo homologs retain the same biological activity 
as the Rfo proteins shown herein, and more preferably, the Rfo homologs restore fertility in a 
cytoplasmic male-sterile plant. To determine the percent homology of two amino acid 
sequences, the sequences are aligned for optimal comparison purposes (e.g., gaps can be 
introduced in the sequence of one protein for optimal alignment with the other protein). The 
amino acid residues at corresponding amino acid positions are then compared. When a 
position in one sequence is occupied by the same amino acid residue as the corresponding 
position in the other sequence, then the molecules are homologous at that position (i.e., as 
used herein amino acid or nucleic acid "homology" is equivalent to amino acid or nucleic 
acid "identity"). The same type of comparison can be made between two nucleic acid 
sequences. 

[0045] The percent homology between the two sequences is a function of the number of 
identical positions shared by the sequences (i.e., % homology = numbers of identical 
positions/total numbers of positions x 100). Preferably, the isolated Rfo protein homologs 
included in the present invention are at least about 50-60%, preferably at least about 60-70%, 
and more preferably at least about 70-80%, 80-90%, 90-95%, and most preferably at least 
about 96%, 97%, 98%, 99% or more homologous to an entire amino acid sequence shown in 
herein. In yet another embodiment, the isolated Rfo protein homologs included in the present 
invention are at least about 50-60%, preferably at least about 60-70%, and more preferably at 
least about 70-80%, 80-90%, 90-95%, and most preferably at least about 96%, 97%, 98%, 
99% or more homologous to an entire amino acid sequence encoded by a nucleic acid 
sequence shown herein. In other embodiments, the isolated Rfo protein homologs have 
homology over at least 15 contiguous amino acid residues, more preferably at least 25 
contiguous amino acid residues, and most preferably at least 35 contiguous amino acid 
residues of the sequences shown herein. In a further preferred embodiment, the Rfoo 
homologs have greater than 90% homology over the PPR motif. 

[0046] In another preferred embodiment, an isolated Rfo nucleic acid homolog of the 
invention comprises a nucleotide sequence which is at least about 50-60%, preferably at least 
about 60-70%, more preferably at least about 70-80%, 80-90%, or 90-95%, and even more 
preferably at least about 95%, 96%, 97%, 98%, 99% or more homologous to a nucleotide 
sequence shown herein, or a portion thereof. The preferable length of sequence comparison 
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for nucleic acids is at least 75 nucleotides, more preferably at least 100 nucleotides and most 
preferably the entire coding region of the nucleic acid 

[0047] With regard to the present invention, a determination of the percent homology 
between two sequences is accomplished using a mathematical algorithm. In a preferred 
embodiment of the present invention, the percent homology between two sequences is 
determined using the mathematical algorithm of Karlin and Altschul (1990 Proc. Natl. Acad. 
Sci. USA 90:5873-5877). Such an algorithm is incorporated into the NBLAST and XBLAST 
programs of Altschul, et aL (1990 J. Mol. Biol. 215:403-410). Accordingly, the present 
invention includes a Rfo nucleic acid homolog having at least 50% homology with the 
nucleotide sequence shown herein as determined using the NBLAST program, score=100, 
wordlength=12. Additionally, the present invention includes a Rfo amino acid homolog 
having at least 70% homology with the amino acid sequence shown herein as determined 
using the XBLAST program, score=50, wordlength=3. When BLAST programs are used to 
determine percent homology, Gapped BLAST is utilized as described in Altschul et al. (1997 
Nucleic Acids Res. 25:3389-3402). When utilizing BLAST and Gapped BLAST programs, 
the default parameters of the respective programs (e.g., XBLAST and NBLAST) are used. 
[0048] In another embodiment of the present invention, the percent homology between 
two sequences is determined using the mathematical algorithm of Smith and Waterman. In 
yet another embodiment, the percent homology between two sequences is determined using 
the mathematical algorithm of Myers and Miller (CABIOS 1989). The Myers and Miller 
algorithm is incorporated into the ALIGN program (version 2.0) that is part of the GCG 
sequence alignment software package. When utilizing the ALIGN program for comparing 
amino acid sequences, a PAM120 weight residue table, a gap length penalty of 12 and a gap 
penalty of 4 is used to obtain Rfo amino acid homologs. 

[00491 Finally, homology between nucleic acid sequences can be determined using 
hybridization techniques known to those of skill in the art. Accordingly, an isolated Rfo 
nucleic acid molecule of the invention comprises a nucleotide sequence which hybridizes, 
e.g., hybridizes under stringent conditions, to the nucleotide sequence shown herein or a 
portion thereof. More particularly, an isolated nucleic acid molecule of the invention is at 
least 15 nucleotides in length and hybridizes under stringent conditions to the nucleic acid 
molecule comprising a nucleotide sequence of herein. In other embodiments, the nucleic acid 
is at least 30, 50, 100, 250 or more nucleotides in length. Preferably, an isolated nucleic acid 
homolog of the invention comprises a nucleotide sequence which hybridizes under highly 
stringent conditions to the nucleotide sequence shown herein and restores fertility when 
expressed in a cytoplasmic male-sterile plant. 
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[0050] As used herein with regard to hybridization, the term "stringent conditions" refers 
to 6X sodium chloride/sodium citrate (SSC) at about 45°C, followed by one or more washes 
in 0.2 to 0.5 X SSC, 0.1 to 0.5% SDS at 50 to 68°C. Additionally, the term "highly stringent 
conditions" refers to 6X SSC at about 45°C, followed by one or more washes in 0.5 X SSC, 
0.5% SDS at 68°C. Preferably, an isolated nucleic acid molecule of the invention that 
hybridizes under stringent or highly stringent conditions to a sequence herein corresponds to 
a naturally occurring nucleic acid molecule. As used herein, a "naturally occraring" nucleic 
acid molecule refers to an RNA or DNA molecule having a nucleotide sequence that occurs 
in nature (e.g., encodes a natural protein). In one embodiment, the nucleic acid encodes a 
naturally occurring Raphanus sativum Rfo protein. 

10051] Using the above-described methods, and others known to those of skill in the art, 
one of ordinary skill in the art can isolate homologs of the Rfo nucleic acids comprising a 
nucleotide sequence shown in SEQ ID NO: 1 and Rfo proteins comprising an amino acid 
sequence shown in SEQ ID NO:4, SEQ ID NO:5 or SEQ ID NO:6. One subset of these 
homologs comprises allelic variants. As used herein, the term "allelic variant" refers to a 
nucleotide sequence containing polymorphisms that lead to changes in the amino acid 
sequences of a Rfo protein and that exist within a natural population (e.g., a plant species or 
variety). Such natural allelic variations can typically result in 1-5% variance in a Rfo nucleic 
acid. Allelic variants can be identified by sequencing the nucleic acid sequence of interest in 
a number of different radish plants, which can be readily carried out by using hybridization 
probes to identify the same Rfo genetic locus in those radish plants. Any and all such nucleic 
acid variations and resulting amino acid polymorphisms or variations in a Rfo protein that are 
the result of natural allelic variation and that do not alter the functional activity of a Rfo 
protein, are intended to be within the scope of the invention. 

[0052] Moreover, nucleic acid molecules encoding Rfo proteins from the same or other 
species such as Rfo analogs, orthologs and paralogs, are intended to be within the scope of the 
present invention. As used herein, the term "analogs" refers to two nucleic acids that have 
the same or similar function, but that have evolved separately in unrelated organisms. As 
used herein, the term "orthologs" refers to two nucleic acids from different species, but that 
have evolved from a common ancestral gene by speciation. Normally, orthologs encode 
proteins having the same or similar functions. As also used herein, the term "paralogs" refers 
to two nucleic acids that are related by duplication within a genome. Paralogs usually have 
different functions, but these functions may be related (Tatusov, R.L. et aL 1997 Science 
278(5338):631-637). 
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[0053] Analogs, orthologs and paralogs of a naturally occurring Rfo nucleic acids can 
encode proteins that differ from a naturally occurring Rfo protein by post-translational 
modifications, by amino acid sequence differences, or by both. Post-translational 
modifications include in vivo and in vitro chemical derivatization of polypeptides, e.g., 
acetylation, carboxylation, phosphorylation, or glycosylation, and such modifications may 
occur during polypeptide synthesis or processing or following treatment with isolated 
modifying enzymes. Li particular, orthologs of the invention will generally exhibit at least 
80-85%, more preferably 90%, and most preferably 95%, 96%, 97%, 98% or even 99% 
identity or homology with aU or part of a naturally occurring Rfo amino acid sequence and 
will exhibit a function similar to a Rfo protein. Preferably, a Rfo ortholog of the present 
invention restores fertility in a cytoplasmic male-sterile plant. More preferably, a Rfo 
ortholog restores fertility in a cytoplasmic male-sterile Brassica napus plant. 

[0054] In addition to naturally occurring variants of a Rfo sequence that may exist in the 
population, the skilled artisan will further appreciate that changes can be introduced by 
mutation into a nucleotide sequence shown herein, thereby leading to changes in the amino 
acid sequence of the encoded Rfo protein, without altering the functional activity of the Rfo 
protein. For example, nucleotide substitutions leading to amino acid substitutions at "non- 
essential" amino acid residues can be made in the sequences. A <s non-essential" amino acid 
residue is a residue that can be altered from the wild-type sequence of one of the Rfo proteins 
without altering the activity of said Rfo protein, whereas an "essential" amino acid residue is 
required for Rfo protein activity. Other amino acid residues, however, (e.g., those not within 
the PPR motif described above) may not be essential for activity and thus are likely to be 
amenable to alteration without altering Rfoo activity. 

[0055] Accordingly, an isolated nucleic acid molecule encoding a Rfo protein 
homologous to a protein sequence herein can be created by introducing one or more 
nucleotide substitutions, additions or deletions into a nucleotide sequence such that one or 
more amino acid substitutions, additions or deletions are introduced into the encoded protein. 
Mutations can be introduced into one of the sequences by standard techniques, such as site- 
directed mutagenesis and PCR-mediated mutagenesis. Preferably, conservative amino acid 
substitutions are made at one or more predicted non-essential amino acid residues. A 
"conservative amino acid substitution" is one in which the amino acid residue is replaced 
with an amino acid residue having a similar side chain. 

[0056] Families of amino acid residues having similar side chains have been defined in 
the art. These families include amino acids with basic side chains (e.g., lysine, arginine, 
histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains 
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(e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side 
chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, 
tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side 
chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted nonessential 
amino acid residue in a Rfo is preferably replaced with another amino acid residue from the 
same side chain family. Alternatively, in another embodiment, mutations can be introduced 
randomly along all or part of a Rfo coding sequence, such as by saturation mutagenesis, and 
the resultant mutants can be screened for a Rfo activity described herein to identify mutants 
that retain Rfo activity. Following mutagenesis of the sequence, the encoded protein can be 
expressed and the activity of the protein can be determined by analyzing the viable pollen 
production of a plant expressing the protein as described above. For example, a Rfo mutant 
that retains activity restores fertility in a male-sterile plant upon its expression in the plant. 
[0057] Additionally, optimized Rfo nucleic acids can be created. Preferably, an optimized 
Rfo nucleic acid encodes a Rfo that restores fertility in a cytoplasmic male-sterile plant, and 
more particularly cytoplasmic male-sterile Brassica napus plant. As used herein, 
"optimized" refers to a nucleic acid that is genetically engineered to increase its expression in 
a given plant or animal. To provide plant optimized Rfo nucleic acids, the DNA sequence of 
the gene can be modified to 1) comprise codons preferred by highly expressed plant genes; 2) 
comprise an A+T content in nucleotide base composition to that substantially found in plants; 
3) form a plant initiation sequence, 4) eliminate sequences that cause destabilization, 
inappropriate polyadenylation, degradation and termination of RNA, or that form secondary 
structure hairpins or RNA splice sites. Increased expression of Rfo nucleic acids in plants can 
be achieved by utilizing the distribution frequency of codon usage in plants in general or a 
particular plant such as Brassica napus. 

[0058] As used herein, "frequency of preferred codon usage" refers to the preference 
exhibited by a specific host cell in usage of nucleotide codons to specify a given amino acid. 
To determine the frequency of usage of a particular codon in a gene, the number of 
occurrences of that codon in the gene is divided by the total number of occurrences of all 
codons specifying the same amino acid in the gene. Similarly, the frequency of preferred 
codon usage exhibited by a host cell can be calculated by averaging the frequency of 
preferred codon usage in a large number of genes expressed by the host cell. It is preferable 
that this analysis be limited to genes that are highly expressed by the host cell. The percent 
deviation of the frequency of preferred codon usage for a synthetic gene from that employed 
by a host cell is calculated first by determining the percent deviation of the frequency of 
usage of a single codon from that of the host cell followed by ob taining the average deviation 
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over all codons. As defined herein, this calculation includes unique codons (i.e., ATG and 
TGG). In general terms, the overall average deviation of the codon usage of an optimized 
gene from that of a host cell is calculated using the equation lA = n= lZXn-Y n Xn times 
100 Z where X„ = frequency of usage for codon n in the host cell; Y n = frequency of usage 
for codon n in the synthetic gene, n represents an individual codon that specifies an amino 
acid and the total number of codons is Z. The overall deviation of the frequency of codon 
usage, A, for all amino acids should preferably be less than about 25%, and more preferably 
less than about 10%. 

[0059] Hence, a Rfo nucleic acid can be optimized such that its distribution frequency of 
codon usage deviates, preferably, no more than 25% from that of highly expressed plant 
genes and, more preferably, no more than about 10%. In addition, consideration is given to 
the percentage G+C content of the degenerate third base (monocotyledons appear to favor 
GH-C in this position, whereas dicotyledons do not). It is also recognized that the XCG 
(where X is A, T, C, or G) nucleotide is the least preferred codon in dicots whereas the XTA 
codon is avoided in both monocots and dicots. Optimized Rfo nucleic acids of this invention 
also preferably have CG and TA doublet avoidance indices closely approximating those of 
the chosen host plant (i.e., Brassica napus). More preferably these indices deviate from that 
of the host by no more than about 10-15%. 

[0060] In addition to the nucleic acid molecules encoding the Rfo's described above, 
another aspect of the invention pertains to isolated nucleic acid molecules that are antisense 
thereto. An "antisense" nucleic acid comprises a nucleotide sequence that is complementary 
to a "sense" nucleic acid encoding a protein, e.g., complementary to the . coding strand of a 
double-stranded cDNA molecule or complementary to an mRNA sequence. Accordingly, an 
antisense nucleic acid can hydrogen bond to a sense nucleic acid. The antisense nucleic acid 
can be complementary to an entire Rfo coding strand, or to only a portion thereof. In one 
embodiment, an antisense nucleic acid molecule is antisense to a "coding region" of the 
coding strand of a nucleotide sequence encoding a Rfo. The term "coding region" refers to 
the region of the nucleotide sequence comprising codons that are translated into amino acid 
residues (e.g., the entire coding region of „,„ comprises nucleotides 1 to ....). In another 
embodiment, the antisense nucleic acid molecule is antisense to a "noncoding region" of the 
coding strand of a nucleotide sequence encoding a Rfo. The term "noncoding region" refers 
to 5' and 3 9 sequences that flank the coding region that are not translated into amino acids 
(i.e., also referred to as 5 9 and 3 9 untranslated regions). 

[0061] In a preferred embodiment, an isolated nucleic acid molecule of the invention 
comprises a nucleic acid molecule which is a complement of the nucleotide sequence shown 
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herein, or a portion thereof, A nucleic acid molecule that is complementary to the nucleotide 
sequence shown herein is one which is sufficiently complementary to the nucleotide sequence 
shown such that it can hybridize to the nucleotide sequence shown, thereby forming a stable 
duplex. 

[0062] Given the coding strand sequences encoding the Rfo's disclosed herein, antisense 
nucleic acids of the invention can be designed according to the rules of Watson and Crick 
base pairing. The antisense nucleic acid molecule can be complementary to the entire coding 
region of Rfo mRNA, but more preferably is an oligonucleotide which is antisense to only a 
portion of the coding or noncoding region of Rfo mRNA. For example, the antisense 
oligonucleotide can be complementary to the region surrounding the translation start site of 
Rfo mRNA. An antisense oligonucleotide can be, for example, about 5, 10, 15, 20, 25, 30, 
35, 40, 45 or 50 or more nucleotides in length. 

[0063] An antisense nucleic acid of the invention can be constructed using chemical 
synthesis and enzymatic ligation reactions using procedures known in the art. For example, 
an antisense nucleic acid (e.g., an antisense oligonucleotide) can be chemically synthesized 
using naturally occurring nucleotides or variously modified nucleotides designed to increase 
the biological stability of the molecules or to increase the physical stability of the duplex 
formed between the antisense and sense nucleic acids, e.g., phosphorothioate derivatives and 
acridine substituted nucleotides can be used. Examples of modified nucleotides which can be 
used to generate the antisense nucleic acid include 5-fluorouracil, 5-bromouracil, 5- 
chlorouracil, 5-iodouracil, hypoxanthine, xanthine, 4-acetylcytosine, 5- 
(carboxyhydroxylmethyl) uracil, 5-carboxymethylaminomethyl-2-thiouridine, 5- 
carboxymethylaminomethyluracil, dihydrouracil, beta-D-galactosylqueosine, inosine, N6- 
isopentenyladenine, 1-methylguanine, 1-methylinosine, 2,2-dimethylguanine, 2- 
methyladenine, 2-methylguanine, 3-methylcytosine, 5-methylcytosine, N6-adenine, 7- 
methylguanine, 5-methylaminomethyluracil, 5-methoxyaminomethyl-2-thiouracil, beta-D- 
mannosylqueosine, S'-methoxycarboxymethyluracil, 5-methoxyuracil, 2-methylthio~N6- 
isopentenyladenine, uracil-5-oxyacetic acid (v), wybutoxosine, pseudouracil, queosine, 2- 
thiocytosine, 5-me%l-2-thiouracil, 2-thiouracil, 4-thiouracil, 5-methyluracil, uracil-5- 
oxyacetic acid methylester, uracil-5-oxyacetic acid (v), 5-methyl-2-thiouracil, 3-(3-amino-3- 
N-2-carboxypropyl) uracil, (acp3)w, and 2,6-diaminopurine. Alternatively, the antisense 
nucleic acid can be produced biologically using an expression vector into which a nucleic 
acid has been subcloned in an antisense orientation (i.e., RNA transcribed from the inserted 
nucleic acid will be of an antisense orientation to a target nucleic acid of interest, described 
further in the following subsection). 
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[0064] The antisense nucleic acid molecules of the invention are typically administered to 
a cell or generated in situ such that they hybridize with or bind to cellular mRNA and/or 
genomic DNA encoding a Rfo to thereby inhibit expression of the protein, e.g., by inhibiting 
transcription and/or translation. The hybridization can be by conventional nucleotide 
complementarity to form a stable duplex, or, for example, in the case of an antisense nucleic 
acid molecule which binds to DNA duplexes, through specific interactions in the major 
groove of the double helix. The antisense molecule can be modified such that it specifically 
binds to a receptor or an antigen expressed on a selected cell surface, e.g., by linking the 
antisense nucleic acid molecule to a peptide or an antibody which binds to a cell surface 
receptor or antigen. The antisense nucleic acid molecule can also be delivered to cells using 
the vectors described herein. To achieve sufficient intracellular concentrations of the 
antisense molecules, vector constructs in which the antisense nucleic acid molecule is placed 
under the control of a strong prokaryotic, viral, or eukaryotic (including plant) promoter are 
preferred. 

[0065] In yet another embodiment, the antisense nucleic acid molecule of the invention is 
an a-anomeric nucleic acid molecule. An oc-anomeric nucleic acid molecule forms specific 
double-stranded hybrids with complementary RNA in which, contrary to the usual (J-units, 
the strands run parallel to each other (Gaultier et al., 1987 Nucleic Acids. Res. 15:6625- 
6641). The antisense nucleic acid molecule can also comprise a 2'-o-methylribonucleotide 
(Inoue et al., 1987 Nucleic Acids Res. 15:6131-6148) or a chimeric RNA-DNA analogue 
(Inoue et al., 1987 FEBS Lett. 215:327-330). 

[0066] In still another embodiment, an antisense nucleic acid of the invention is a 
ribozyme. Ribozymes are catalytic RNA molecules with ribonuclease activity that are 
capable of cleaving a single-stranded nucleic acid, such as an mRNA, to which they have a 
complementary region. Thus, ribozymes (e.g., hammerhead ribozymes described in 
Haselhoff and Gerlach, 1988 Nature 334:585-591) can be used to catalytically cleave Rfo 
mRNA transcripts to thereby inhibit translation of Rfo mRNA. A ribozyme having 
specificity for a Rfo-encoding nucleic acid can be designed based upon the nucleotide 
sequence of a Rfo cDNA, corresponding to an ORF of a Rfo nucleic acid provided herein or 
on the basis of a heterologous sequence to be isolated according to methods taught in this 
invention. For example, a derivative of a Tetrahymena L-19 IVS RNA can be constructed in 
which the nucleotide sequence of the active site is complementary to the nucleotide sequence 
to be cleaved in a Rfo-encoding mRNA. See, e.g., Cech et al. U.S. Patent No. 4,987,071 and 
Cech et al. U.S. Patent No. 5,116,742. Alternatively, Rfo mRNA can be used to select a 
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catalytic RNA having a specific ribonuclease activity from a pool of RNA molecules. See, 
e.g., Bartel, D. and Szostak, J.W., 1993 Science 261:1411-1418. 

[0067] Alternatively, Rfo gene expression can be inhibited by targeting nucleotide 
sequences complementary to the regulatory region of a Rfo nucleotide sequence (e.g., a Rfo 
promoter and/or enhancer) to form triple helical structures that prevent transcription of a Rfo 
gene in target cells. See generally, Helene, C, 1991 Anticancer Drug Des. 6(6):569-84; 
Helene, C. et al., 1992 Ann. N.Y. Acad. Sci. 660:27-36; and Maher, L.J., 1992 Bioassays 
14(12):807-15. 

[0068] In addition to the Rfo nucleic acids and proteins described above, the present 
invention encompasses these nucleic acids and proteins attached to a moiety. These moieties 
include, but are not limited to, detection moieties, hybridization moieties, purification 
moieties, delivery moieties, reaction moieties, binding moieties, and the like. A typical group 
of nucleic acids having moieties attached includes probes and primers. Probes and primers 
typically comprise a substantially isolated oligonucleotide. The oligonucleotide typically 
comprises a region of nucleotide sequence that hybridizes under stringent conditions to at 
least about 12, preferably about 25, more preferably about 40, 50 or 75 consecutive 
nucleotides of a sense strand of the sequence set forth in the sequences, an anti-sense 
sequence of the sequence set forth in the sequences, or naturally occurring mutants thereof. 
Primers based on a nucleotide sequences herein can be used in PCR reactions to clone Rfo 
homologs. Probes based on the Rfo nucleotide sequences can be used to detect transcripts or 
genomic sequences encoding the same or homologous proteins. In preferred embodiments, 
the probe further comprises a label group such as a radioisotope, a fluorescent compound, an 
enzyme or an enzyme co-factor. Such probes can be used as a part of a genomic marker test 
kit for identifying cells which express a Rfo nucleic acid, such as by measuring a level of a 
i?/b-encoding nucleic acid, in a sample of cells, e.g., detecting Rfo mRNA levels or 
determining whether a genomic Rfo gene has been mutated or deleted. 

[0069] The invention further provides an isolated recombinant expression vector 
comprising a Rfo nucleic acid as described above, wherein expression of the vector in a host 
plant results in increased produced of viable pollen. As used herein, the term 'Vector'' refers 
to a nucleic acid molecule capable of transporting another nucleic acid to which it has been 
linked. One type of vector is a "plasmid", which refers to a circular double stranded DNA 
loop into which additional DNA segments can be ligated. Another type of vector is a viral 
vector, wherein additional DNA segments can be ligated into the viral genome. Certain 
vectors are capable of autonomous replication in a host cell into which they are introduced 
(e.g., bacterial vectors having a bacterial origin of replication and episomal mammalian 
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vectors). Other vectors (e.g., non-episomal mammalian vectors) are integrated into the 
genome of a host cell upon introduction into the host cell, and thereby are replicated along 
with the host genome. Moreover, certain vectors are capable of directing the expression of 
genes to which they are operatively linked. Such vectors are referred to herein as "expression 
vectors". In general, expression vectors of utility in recombinant DNA techniques are often 
in the form of plasmids. In the present specification, '^lasmid" and 'Vector" can be used 
interchangeably as the plasmid is the most commonly used form of vector. However, the 
invention is intended to include such other forms of expression vectors, such as viral vectors 
(e.g., replication defective retroviruses, adenoviruses and adeno-associated viruses), which 
serve equivalent functions. 

[0070] The recombinant expression vectors of the invention comprise a nucleic acid of 
the invention in a form suitable for expression of the nucleic acid in a host cell, which means 
that the recombinant expression vectors include one or more regulatory sequences, selected 
on the basis of the host cells to be used for expression, which is operatively linked to the 
nucleic acid sequence to be expressed. Within a recombinant expression vector, "operably 
linked" is intended to mean that the nucleotide sequence of interest is linked to the regulatory 
sequerice(s) in a manner which allows for expression of the nucleotide sequence (e.g., in an in 
vitro transcription/ translation system or in a host cell when the vector is introduced into the 
host cell). The term Regulatory sequence" is intended to include promoters, enhancers and 
other expression control elements (e.g., polyadenylation signals). Such regulatory sequences 
are described, for example, in Goeddel, Gene Expression Technology: Methods in 
Enzymology 185, Academic Press, San Diego, CA (1990) or see: Gruber and Crosby, in: 
Methods in Plant Molecular Biology and Biotechnology, eds. Glick and Thompson, Chapter 
7, 89-108, CRC Press: Boca Raton, Florida, including the references therein. Regulatory 
sequences include those that direct constitutive expression of a nucleotide sequence in many 
types of host cells and those that direct expression of the nucleotide sequence only in certain 
host cells or under certain conditions. It will be appreciated by those skilled in the art that the 
design of the expression vector can depend on such factors as the choice of the host cell to be 
transformed, the level of expression of protein desired, etc. The expression vectors of the 
invention can be introduced into host cells to thereby produce proteins or peptides, including 
fusion proteins or peptides, encoded by nucleic acids as described herein (e.g., Rfo proteins, 
mutant forms of Rfo proteins, fusion proteins, etc.). 

[0071] The recombinant expression vectors of the invention can be designed for 
expression of Rfo's in prokaryotic or eukaryotic cells. For example, Rfo genes can be 
expressed in multicellular plant cells (see Schmidt, R. and Willmitzer, L., 1988 High 
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efficiency Agrobacterium tumefaciens-mediated transformation of Arabidopsis ihaliana leaf 
and cotyledon explants, Plant Cell Rep. 583-586); Plant Molecular Biology and 
Biotechnology, C Press, Boca Raton, Florida, chapter 6/7, S.71-119 (1993); F.F. White, B. 
Jenes et al., Techniques for Gene Transfer, in: Transgenic Plants, Vol. 1, Engineering and 
Utilization, eds. Kung und R. Wu, 128-43, Academic Press: 1993; Potrykus, 1991 Annu. 
Rev. Plant Physiol. Plant Molec. Biol. 42:205-225 and references cited therein); C. 
glutamicum, insect cells (using baculovirus expression vectors), yeast and other fungal cells 
(see Romanos, M.A. et al., 1992 Foreign gene expression in yeast: a review, Yeast 8:423- 
488; van den Hondel, C.A.MJJ. et al., 1991 Heterologous gene expression in filamentous 
ftmgi, in: More Gene Manipulations in Fungi, J.W. Bennet & L.L. Lasure, eds., p. 396-428: 
Academic Press: San Diego; and van den Hondel, C.A.MJJ. & Punt, P. J., 1991 Gene 
transfer systems and vector development for filamentous fungi, in: Applied Molecular 
Genetics of Fungi, Peberdy, J.F. et al., eds., p. 1-28, Cambridge University Press: 
Cambridge), algae (Falciatore et al., 1999 Marine Biotechnology 1(3):239-251) or 
mammalian cells. Suitable host cells are discussed further in Goeddel, Gene Expression 
Technology: Methods in Enzymology 185, Academic Press: San Diego, CA (1990). 
Alternatively, the recombinant expression vector can be transcribed and translated in vitro, 
for example using T7 promoter regulatory sequences and T7 polymerase. 
[0072] Expression of proteins in prokaryotes is most often carried out with vectors 
containing constitutive or inducible promoters directing the expression of either fusion or 
non-fusion proteins. Fusion vectors add a number of amino acids to a protein encoded 
therein, usually to the amino terminus of the recombinant protein but also to the C-terminus 
or fused within suitable regions in the proteins. Such fusion vectors typically serve three 
purposes: 1) to increase expression of a recombinant protein; 2) to increase the solubility of a 
recombinant protein; and 3) to aid in the purification of a recombinant protein by acting as a 
ligand in affinity purification. Often, in fusion expression vectors, a proteolytic cleavage site 
is introduced at the junction of the fusion moiety and the recombinant protein to enable 
separation of the recombinant protein from the fusion moiety subsequent to purification of 
the fusion protein. Such enzymes, and their cognate recognition sequences, include Factor 
Xa, thrombin and enterokinase. 

[0073J Typical fusion expression vectors include pGEX (Pharmacia Biotech Inc; Smith, 
D.B. and Johnson, K.S., 1988 Gene 67:31-40), pMAL (New England Biolabs, Beverly, MA) 
and pRTT5 (Pharmacia, Piscataway, NJ) which fuse glutathione S-transferase (GST), maltose 
E binding protein, or protein A, respectively, to the target recombinant protein. In one 
embodiment, the coding sequence of the Rfo is cloned into a pGEX expression vector to 
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create a vector encoding a fusion protein comprising, from the N-terminus to the C-terminus, 
GST-thrombin cleavage site-X protein. The fusion protein can be purified by affinity 
chromatography using glutathione-agarose resin. Recombinant Rfo unfused to GST can be 
recovered by cleavage of the fusion protein with thrombin. 

[0074] Examples of suitable inducible non-fusion E. coli expression vectors include pTrc 
(Amann et al., 1988 Gene 69:301-315) and pET lid (Studier et al., Gene Expression 
Technology: Methods in Enzymology 185, Academic Press, San Diego, California (1990) 60- 
89). Target gene expression from the pTrc vector relies on host RNA polymerase 
transcription from a hybrid trp-lac fiision promoter. Target gene expression from the pET 
lid vector relies on transcription from a T7 gnlO-lac fusion promoter mediated by a co- 
expressed viral RNA polymerase (T7 gnl). This viral polymerase is supplied by host strains 
BL21(DE3) or HMS174(DE3) from a resident X prophage harboring a T7 gnl gene under the 
transcriptional control of the lacUV 5 promoter. 

[0075] In another embodiment, the Rfo expression vector is a yeast expression vector. 
Examples of vectors for expression in yeast S. cerevisiae include pYepSecl (Baldari, et al., 
1987 EMBO J. 6:229-234), pMFa (Kurjan and Herskowitz, 1982 Cell 30:933-943), pJRY88 
(Schultz et al., 1987 Gene 54:113-123), and pYES2 (Invitrogen Corporation, San Diego, 
CA). Vectors and methods for the construction of vectors appropriate for use in other fungi, 
such as the filamentous fungi, include those detailed in: van den Hondel, C. AM. J J. & Punt, 
PJ. (1991) "Gene transfer systems and vector development for filamentous fungi", in: 
Applied Molecular Genetics of Fungi, J.F. Peberdy, et al., eds., p. 1-28, Cambridge 
University Press: Cambridge. 

[0076] Alternatively, the Rfo of the invention can be expressed in insect cells using 
baculovirus expression vectors. Baculovirus vectors available for expression of proteins in 
cultured insect cells (e.g., Sf 9 cells) include the pAc series (Smith et al,, 1983 Mol. Cell Biol. 
3:2156-2165) and the pVL series (Lucklow and Summers, 1989 Virology 170:31-39). 

[0077] In another embodiment, the Rfo of the invention may be expressed in unicellular 
plant cells (such as algae) (see Falciatore et al., 1999 Marine Biotechnology 1(3):239-251 and 
references therein), and more preferably, plant cells from higher plants (e.g., the 
spermatophytes, such as crop plants). Examples of plant expression vectors include those 
detailed in: Becker, D., Kemper, E., Schell, J. and Masterson, R., 1992 New plant binary 
vectors with selectable markers located proximal to the left border, Plant Mol. Biol. 20: 1 195- 
1197; and Bevan, M.W., 1984 Binary Agrobacterium vectors for plant transformation, Nucl. 
Acid. Res. 12:8711-8721; Vectors for Gene Transfer in Higher Plants; in: Transgenic Plants, 
Vol. 1, Engineering and Utilization, eds.: Kung and R Wu, Academic Press, 1993, S. 15-38. 

28 



WO 03/006622 PCT/US02/22217 

[0078] A plant expression cassette preferably contains regulatory sequences capable of 
driving gene expression in plant cells and operably linked so that each sequence can fulfill its 
function, for example, termination of transcription by polyadenylation signals. Preferred 
polyadenylation signals are those originating from Agrobacterium tumefaciens t-DNA such 
as the gene 3 known as octopine synthase of the Ti-plasmid pTiACH5 (Gielen et al., 1984 
EMBO J. 3:835) or functional equivalents thereof but also all other terminators functionally 
active in plants are suitable. As plant gene expression is very often not limited on 
transcriptional levels, a plant expression cassette preferably contains other operably linked 
sequences like translational enhancers such as the overdrive-sequence containing the 5'- 
untranslated leader sequence from tobacco mosaic virus enhancing the protein per RNA ratio 
(Gallie et al., 1987 Nucl. Acids Research 15:8693-8711). 

[0079] Plant gene expression must be operably linked to an appropriate promoter in order 
to confer gene expression in a timely, cell or tissue specific manner. Preferred are promoters 
driving constitutive expression (Benfey et al., 1989 EMBO J. 8:2195-2202) like those derived 
from plant viruses like the 35S CAMV (Franck et al., 1980 Cell 21:285-294), the 19S CaMV 
(see also U.S. Patent No. 5,352,605 and PCT Application No. WO 8402913) or plant 
promoters like those from Rubisco small subunit described in U.S. Patent No. 4,962,028. 

Especially preferred are those promoters that confer gene expression in specific 
plant tissues and organs, such as stamens and anthers. In this regard, a promoter which 
expresses during stamen development would be preferred as such a promoter is particularly 
appropriate to drive Rfo expression resulting in altered pollen production as desired. 
Examples of such promoters include the AP3 promoter,the Lat52 promoter (Twell, D. et al. 
(1989). Mol. Gen. Genet. 217, 240-248; Twell, D. et al. (1990). Development 109, 705-715.), 
the A9 promoter (Paul, W, et al, (1992). Plant Mol. Biol. 19, 611-622.), the fbpl promoter 
(Angenent, G.C. (1993). Plant J. 4, 101-112), the EPF2-5 promoter (Takatsuji, H. et al. 
(1994). Plant Cell 6, 947-958), and the p&4 promoter (Christensen, H.E. et al. (1996). Plant 
J. 10, 269-279). However, the utility of the present methods are not restricted with respect to 
the promoter. As will be appreciated by one of skill in the art, constitutive promoters and 
promoters which express during other stages of plant development, for example prior to 
stamen development, may also be useful in the present methods. 

[0080] Plant gene expression can also be facilitated via an inducible promoter (for review, 
see Gatz, 1997 Annu. Rev. Plant Physiol. Plant Mol. Biol. 48:89-108). Chemically inducible 
promoters are especially suitable if gene expression is wanted to occur in a time specific 
manner. Examples of such promoters are a salicylic acid inducible promoter (PCT 
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Application No. WO 95/19443), a tetracycline inducible promoter (Gatz et aL, 1992 Plant J. 
2:397-404) and an ethanol inducible promoter (PCT Application No. WO 93/21334). 
[0081] Other preferred sequences for use in plant gene expression cassettes are targeting- 
sequences necessary to direct the gene product in its appropriate cell compartment (for 
review see Kermode, 1996 Crit Rev. Plant Sci. 15(4):285-423 and references cited therein) 
such as the vacuole, the nucleus, all types of plastids like amyloplasts, chloroplasts, 
chromoplasts, the extracellular space, mitochondria, the endoplasmic reticulum, oil bodies, 
peroxisomes and other compartments of plant cells. 

[0092] In addition to providing a recombinant expression vector comprising a Rfo DNA 
molecule of the invention cloned into the expression vector in a sense orientation, the present 
invention provides such a vector wherein the Rfo DNA molecule is cloned into the vector in 
the antisense orientation. That is, the DNA molecule is operatively linked to a regulatory 
sequence in a manner that allows for expression (by transcription of the DNA molecule) of an 
RNA molecule that is antisense to a Rfo mRNA. Regulatory sequences operatively linked to 
a nucleic acid molecule cloned in the antisense orientation can be chosen which direct the 
continuous expression of the antisense RNA molecule in a variety of cell types. For instance, 
viral promoters and/or enhancers, or regulatory sequences can be chosen which direct 
constitutive, tissue specific or cell type specific expression of antisense RNA. The antisense 
expression vector can be in the form of a recombinant plasmid, phagemid or attenuated virus 
wherein antisense nucleic acids are produced under the control of a high efficiency regulatory 
region. The activity of the regulatory region can be determined by the cell type into which 
the vector is introduced. For a discussion of the regulation of gene expression using antisense 
genes see Weintraub, H. et al., Antisense RNA as a molecular tool for genetic analysis, 
Reviews - Trends in Genetics, Vol. 1(1) 1986 and Mol et al., 1990 FEBS Letters 268:427- 
430. The term antisense RNA is intended to also cover double stranded interfering RNAs 
(RNAi), which induce selective degradation of the RNAs complementary to one of the two 
dsRNA strands. 

[0093] Another aspect of the invention pertains to host cells into which a recombinant 
expression vector of the invention has been introduced. The terms "host cell" and 
"recombinant host cell" are used interchangeably herein. It is understood that such terms 
refer not only to the particular subject cell but they also apply to the progeny or potential 
progeny of such a cell. Because certain modifications may occur in succeeding generations 
due to either mutation or environmental influences, such progeny may not, in fact, be 
identical to the parent cell, but are still included within the scope of the term as used herein. 
A host cell can be any prokaryotic or eukaryotic cell. For example, a Rfo can be expressed in 
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bacterial cells such as C glutamicum, insect cells, fungal cells or mammalian cells (such as 
Chinese hamster ovary cells (CHO) or COS cells), algae, ciliates, plant cells, fungi or other 
microorganisms like C. glutamicum. In a preferred embodiment, the host cell is a plant cell, 
more preferably, a Brassica napus plant cell, and most preferably, a stamen or anther cell. 
[0094] Vector DNA can be introduced into prokaryotic or eukaryotic cells via 
conventional transformation or transfection techniques. As used herein, the terms 
'tansformation", *transfection", "conjugation" and "transduction" are intended to refer to a 
variety of art-recognized techniques for introducing foreign nucleic acid (e.g., DNA) into a 
host cell, including calcium phosphate or calcium chloride co-precipitation, DEAE-dextran- 
mediated transfection, lipofection, natural competence, chemical-mediated transfer and 
electroporation. Suitable methods for transforming or transfecting host cells including plant 
cells can be found in Sambrook, et al. (Molecular Cloning: A Laboratory Manual. 2 nd , ed., 
Cold Spring Harbor Laboratory, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 
NY, 1989) and other laboratory manuals such as Methods in Molecular Biology, 1995, Vol. 
44, Agrobacterium protocols, ed: Gartland and Davey, Humana Press, Totowa, New Jersey. 
[0095] In particular, the invention provides a method of producing a transgenic plant with 
a Rfo coding nucleic acid, wherein the plant is a cytoplasmic male-sterile plant and wherein 
expression of the nucleic acid in the plant results in restoration of fertility of the plant 
comprising: (a) transforming a plant cell with an expression vector comprising a Rfo nucleic 
acid, and (b) generating from the plant cell a transgenic plant. The plant cell includes, but is 
not limited to, a gamete producing cell, a protoplast and any other cell that regenerates into a 
whole plant. 

[0096] For such plant transformation, binary vectors such as pBinAR can be used 
(H6fgen and Willmitzer, 1990 Plant Science 66:221-230). Construction of the binary vectors 
can be performed by ligation of the cDNA in sense or antisense orientation into the T-DNA. 
5-prime to the cDNA a plant promoter activates transcription of the cDNA. A 
polyadenylation sequence is located 3-prime to the cDNA. Tissue-specific expression can be 
achieved by using a tissue specific promoter. For constitutive expression within the whole 
plant, the CaMV 35S promoter can be used. The expressed protein can be targeted to a 
cellular compartment using a signal peptide, for example for plastids, mitochondria or 
endoplasmic reticulum (Kennode, 1996 Crit. Rev. Plant Sci. 4 (15):285-423). The signal 
peptide is cloned 5-prime in frame to the cDNA to archive subcellular localization of the 
fusion protein. One skilled in the art will recognize that the promoter used should be 
operatively linked to the nucleic acid such that the promoter causes transcription of the 
nucleic acid and results in the synthesis of a mRNA which encodes a polypeptide. 
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Alternatively, the RNA can be an antisense RNA for use in affecting subsequent expression 
of the same or another gene or genes. 

[0097] Alternate methods of transfection include the direct transfer of DNA into 
developing flowers via electroporation or Agrobacterium mediated gene transfer. 
Agrobacterium mediated plant transformation can be performed using for example the 
GV3101(pMP90) (Koncz and Schell, 1986 Mol. Gen. Genet. 204:383-396) or LBA4404 
(Clontech) Agrobacterium tumefaciens strain. Transformation can be performed by standard 
transformation and regeneration techniques (Deblaere et al., 1994 Nucl. Acids. Res. 13:4777- 
4788; Gelvin, Stanton B. and Schilperoort, Robert A, Plant Molecular Biology Manual, 2 nd 
Ed. - Dordrecht : Kluwer Academic Publ, 1995. - in Sect., Ringbuc Zentrale Signatur: BT11- 
P ISBN 0-7923-2731-4; Glick, Bernard R; Thompson, John E., Methods in Plant Molecular 
Biology and Biotechnology, Boca Raton : CRC Press, 1993 360 S., ISBN 0-8493-5164-2). 

[0098] A host cell of the invention, such as a prokaryotic or eukaryotic host cell in 
culture, can be used to produce (i.e., express) a Rfo. Accordingly, the invention further 
provides methods for producing Rfo proteins using the host cells of the invention. In one 
embodiment, the method comprises culturing the host cell of invention (into which a 
recombinant expression vector encoding a Rfo protein has been introduced, or into which 
genome has been introduced a gene encoding a wild-type or altered Rfo protein) in a suitable 
medium until Rfo protein is produced. In another embodiment, the method further comprises 
isolating Rfo proteins from the medium or the host cell. 

[0099] Another aspect of the invention pertains to isolated Rfo proteins, and biologically 
active portions thereof. An "isolated" or "purified" protein or biologically active portion 
thereof is free of some of the cellular material when produced by recombinant DNA 
techniques, or chemical precursors or other chemicals when chemically synthesized. The 
language "substantially free of cellular material" includes preparations of Rfo protein in 
which the protein is separated from some of the cellular components of the cells in which it is 
naturally or recombinant^ produced. In one embodiment, the language "substantially free of 
cellular material" includes preparations of a Rfo protein having less than about 30% (by dry 
weight) of non-Rfo protein material (also referred to herein as a "contaminating protein"), 
more preferably less than about 20% of non-Rfo protein material, still more preferably less 
than about 10% of non-Rfo protein material, and most preferably less than about 5% non-Rfo 
protein material. 

[00100] When the Rfo protein or biologically active portion thereof is recombinantly 
produced, it is also preferably substantially free of culture medium, i.e., culture medium 
represents less than about 20%, more preferably less than about 10%, and most preferably 
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less than about 5% of the volume of the protein preparation. The language "substantially free 
of chemical precursors or other chemicals" includes preparations of Rfo protein in which the 
protein is separated from chemical precursors or other chemicals that are involved in the 
synthesis of the protein. In one embodiment, the language "substantially free of chemical 
precursors or other chemicals" includes preparations of a Rfo protein having less than about 
30% (by dry weight) of chemical precursors or non-Rfo protein chemicals, more preferably 
less than about 20% chemical precursors or non-Rfo protein chemicals, still more preferably 
less than about 10% chemical precursors or non-Rfo protein chemicals, and most preferably 
less than about 5% chemical precursors or non-Rfo protein chemicals. In preferred 
embodiments, isolated proteins, or biologically active portions thereof, lack contaminating 
proteins from the same organism from which the Rfo protein is derived. 
[00101] The present invention also provides antibodies that specifically bind to a Rfo 
protein, or a portion thereof, as encoded by a nucleic acid described herein. Antibodies can 
be made by many well-known methods (See, e.g. Harlow and Lane, "Antibodies; A 
Laboratory Manual" Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 
(1988)). Briefly, purified antigen can be injected into an animal in an amount and in intervals 
sufficient to elicit an immune response. Antibodies can either be purified directly, or spleen 
cells can be obtained from the animal. The cells can then fused with an immortal cell line 
and screened for antibody secretion. The antibodies can be used to screen nucleic acid clone 
libraries for cells secreting the antigen. Those positive clones can then be sequenced. (See, 
for example, Kelly et al., 1992 Bio/Technology 10:163-167; Bebbington . et al., 1992 
Bio/Technology 10:169-175). 

[00102] The phrases "selectively binds" and "specifically binds" when referring to binding 
to a polypeptide refer to a binding reaction that is determinative of the presence of the protein 
in a heterogeneous population of proteins and other biologies. Thus, under designated 
immunoassay conditions, the specified antibodies bound to a particular protein do not bind in 
a significant amount to other proteins present in the sample. Selective binding of an antibody 
under such conditions may require an antibody that is selected for its specificity for a 
particular protein. A variety of immunoassay formats may be used to select antibodies that 
selectively bind with a particular protein. For example, solid-phase ELISA immunoassays 
are routinely used to select antibodies selectively immunoreactive with a protein. See 
Harlow and Lane "Antibodies, A Laboratory Manual" Cold Spring Harbor Publications, New 
York, (1988), for a description of immunoassay formats and conditions that could be used to 
determine selective binding. 
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[00103] In some instances, it is desirable to prepare monoclonal antibodies fiom various 
hosts. A description of techniques for preparing such monoclonal antibodies may be found in 
Stites et aL, editors, s< Basic and Clinical Immunology," (Lange Medical Publications, Los 
Altos, Calif., Fourth Edition) and references cited therein, and in Harlow and Lane 
("Antibodies, A Laboratory Manual" Cold Spring Harbor Publications, New York, 1988). 
[00104] The nucleic acid molecules, proteins, protein homologs, fusion proteins, primers, 
vectors, and host cells described herein can be used in one or more of the following methods: 
producing a hybrid plant that contains the nucleic acid molecules of the present invention, 
modulating fertility in a plant, increasing production of viable pollen in a cytoplasmic male- 
sterile plant, restoring fertility in a cytoplasmic male-sterile plant, identifying Raphanus 
sativum and related organisms, mapping of genomes of organisms related to Raphanus 
sativum, identifying and localizing Raphanus sativum sequences of interest and peJ?/brming 
evolutionary studies. 

[00105] Accordingly, the present invention provides a method of producing a hybrid plant, 
comprising crossing a male-fertile plant containing the Rfo nucleic acids of the present 
invention with a male-sterile plant, collecting hybrid seed from the male-sterile plant and 
regenerating the hybrid plant from the seed. In a preferred embodiment, the male-sterile 
plant contains an ogu cytoplasmic male sterility determinant The present invention also 
provides a method of modulating the fertility of a transgenic plant that includes expressing a 
Rfo nucleic acid in the plant Preferably, the plant comprises a male-sterility determinant and 
expression of the nucleic acid sequence in the plant results in increased production of viable 
pollen by the plant. In a preferred embodiment, expression of the nucleic acid sequence in 
the plant results in restoration of fertility of the plant. In a more preferred embodiment, the 
plant contains an ogu cytoplasmic male sterility determinant. The present invention describes 
using the expression of Rfo of Raphanus sativum to restore fertility in male-sterile plants. 
The invention also provides a transgenic plant containing a Rfo nucleic acid or a fragment 
thereof, wherein the plant has increased fertility or viable pollen production as compared to a 
cytoplasmic male-sterile plant of the same variety. The transgenic plant can be a monocot or 
a dicot. The invention further provides that the transgenic plant can be selected from maize, 
wheat, rye, oat, triticale, rice, barley, soybean, peanut, rapeseed, canola, pepper, sunflower, 
Vicia species, pea, alfalfa, bushy plants (coffee, cacao, tea), Salix species, radish, sorghum, 
pearl millet, cotton, and tobacco, hi a preferred embodiment, the transgenic plant is a canola 
plant selected from the group consisting of Brassica napus, Brassica rapa (ot campestris), 
Brassica oleracea, Brassica nigra, Brassica juncea, Sinapis alba, and Brassica carinata. In 
a more preferred embodiment, the male-sterile plant is Brassica napus. 
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[00106] The present invention also allows for the production of a true breeding variety of 
plants that are capable of restoring male fertility in a Fl hybrid descendant of a plant of the 
present invention and a male-sterile plant. This type of true breeding variety of a fertility 
restorer plant is also termed a '"restorer line". The teams "variety" and "line" refer to a group 
of plants within a species that share constant characters that separate them from the typical 
form and from other possible varieties within that species. While possessing at least one 
distinctive trait, a variety or line is also characterized by some variation between individuals 
within the variety or line, based primarily on the Mendelian segregation of traits among the 
progeny of succeeding generations. A variety or line is considered "true breeding" for a 
particular trait if it is genetically homozygous for that trait to the extent that, when the true- 
breeding variety or line is self-pollinated, a significant amount of independent segregation of 
the trait among the progeny is not observed. In the present invention, the trait arises from the 
transgenic expression of a single DNA sequence introduced into a plant variety or plant line. 
[00107] In addition to introducing the Rfo nucleic acids into transgenic plants, these 
sequences can also be used to identify a plant as being Raphanus sativum or a close relative 
thereof. Also, they may be used to identify the presence of Raphanus sativum or a relative 
thereof in a mixed population of plants. The invention provides the nucleic acid sequences of 
a number of Raphanus sativum genes; by probing the extracted genomic DNA of a culture of 
a unique or mixed population of plants under stringent conditions with a probe spanning a 
region of a Raphanus sativum gene which is unique to this plant, one can ascertain whether 
this plant or the genus is present. 

[001081 More importantly, the Rfo nucleic acids can be used to isolate Rfo homologs in 
other species. The nucleotide sequences determined from the cloning of the Rfo genes from 
Raphanus sativum allow for the generation of probes and primers designed for use in 
identifying and/or cloning Rfo homologs in other cell types and plants such as Brassica 
napus 9 as well as Rfo homologs from other radishes and related species. 
[00109] Further, the nucleic acid and protein molecules of the invention may serve as 
markers for specific regions of the genome. This has utility not only in the mapping of the 
genome, but also in functional studies of Raphanus sativum proteins. For example, to 
identify the region of the genome to which a particular Raphanus sativum DNA-binding 
protein binds, the Raphanus sativum genome could be digested, and the fragments incubated 
with the DNA-binding protein. Those fragments that bind the protein may be additionally 
probed with the nucleic acid molecules of the invention, preferably with readily detectable 
labels. Binding of such a nucleic acid molecule to the genome fragment enables the 
localization of the fragment to the genome map of Raphanus sativum, and, when performed 
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multiple times with different enzymes, facilitates a rapid determination of the nucleic acid 
sequence to which the protein binds. Further, the nucleic acid molecules of the invention 
may be sufficiently homologous to the sequences of related species such that these nucleic 
acid molecules may serve as markers for the construction of a genomic map in related 
radishes. 

[00110] The nucleic acid and protein molecules of the invention may also be used as 
markers to identify regions of the jR/o-radish genome as being tightly genetically-linked to the 
Rfo gene. For example, DNA sequences given in SEQ ID NO:87 could be used as probes to 
identify restriction fragment length polymorphisms (Tanksley et al (1987) In: Chromosome 
structure and function. Plenum Press N.Y. ppl57-173) that are tightly genetically linked to 
the Rfo gene. These DNA markers could then be used to identify individuals in plant 
breeding programs that might have a minimal amount of radish DNA in the region flanking 
Rfo and which therefore would be highly useful for the development of effective, low 
glucosinolate B. napus restorer lines. Similarly, these sequences could be used to design 
PCR based DNA markers, such as SNPs (The International SNP Map Working Group (2001) 
Nature 409: 928-933) and SSRs (Tautz D (1989) Nucl Acids Res 17: 6463-6471) that could 
be used in simlar fashion. 

[00111] The nucleic acid molecules given in SEQ ID NO:87 that function as restorer genes 
could also be used as selection markers to identify transformed plant cells. For example, a 
sequence that functions as a restorer gene could be combined with another gene of interest in 
a transformation vector. The vector could then be introduced into plant cells by any of a 
number of methods such as Agrobacterium-mediated transformation. If the plant cells into 
which the DNA is introduced are cytoplasmically male sterile, the anthers that form on the 
regenerated plants will normally not produce any pollen and no seed will form from these 
flowers by self-pollination. If, however, the anthers form from cells that have acquired a 
restorer gene and the associated gene of interest, the anthers will produce pollen and these 
flowers will form seeds. This provides a powerful selection system for the identification of 
transformed plants or parts of plants. The use of restorer genes, which are purely normal 
plant genes, as selection markers may have some advantages over other currently used 
selection markers with respect to regulatory issues. The use of restorer genes as selection 
markers is not intended to be limited to the ogu CMS system and Rfo, but rather could apply 
to any restorer gene for any CMS system in any plant species. 

[00112] The Rfo nucleic acid molecules of the invention are also useful for evolutionary 
and protein structural studies. By comparing the sequences of the nucleic acid molecules of 
the present invention to those encoding similar proteins from other organisms, the 
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evolutionary relatedness of the organisms can be assessed. Similarly, such a comparison 
permits an assessment of which regions of the sequence are conserved and which are not, 
which may aid in detemiining those regions of the protein that are essential for the 
functioning of the Rfo proteins. This type of determination is of value for protein 
engineering studies and may give an indication of what the protein can tolerate in terms of 
mutagenesis without losing function. 

[00113] Throughout this application, various publications are referenced. The disclosures 
of all of these publications and those references cited within those publications in their 
entireties are hereby incorporated by reference into this application in order to more fully 
describe the state of the art to which this invention pertains. It should also be understood that 
the foregoing relates to preferred embodiments of the present invention and that numerous 
changes may be made therein without departing from the scope of the invention. The 
invention is further illustrated by the following examples, which are not to be construed in 
any way as imposing limitations upon the scope thereof. On the contrary, it is to be clearly 
understood that resort may be had to various other embodiments, modifications, and 
equivalents thereof, which, after reading the description herein, may suggest themselves to 
those skilled in the art without departing from the spirit of the present invention and/or the 
scope of the appended claims. 

EXAMPLES 

Materials and Methods 

Plant growth, DNA isolation, marker analyses and library construction 
[00114] Radish plants were grown in growth chambers with a 16 hour photoperiod cycle, 
at 20°C day and 15°C night. To self-fertilize these plants, a bud that was about to open was 
teezed apart and its anthers and petals removed. An anther from another flower on the same 
plant was picked and used for pollination. The pollen was applied directly to the stigma. This 
procedure was repeated on 3 or 4 buds of the same branch. Any nearby open flowers were 
detached. A white paper crossing bag was used to cover the buds. The plants were then 
placed in a greenhouse to allow for seed maturation. Fresh leaf material was used for DNA 
extraction. DNA extraction, restriction enzyme digestion, Southern blotting and RFLP 
analysis were carried according to Cheung WY et al (1997, Theor Appl Genet 95:73-82). 
Small scale DNA preparations suitable for PCR analysis were according to Cheung WY, 
Hubert N, Landry BS (1993). PCR Methods Applic 3:69-70. 

[00115] Genomic DNA from radish plants homozygous for the Rfo restorer locus was 
isolated, and BAC library construction and preparation of probes from BAC ends was 
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according to Woo SS, Jiang J, Gill BS, Paterson AH, Wing RA (1994) Nucleic Acids Res 22: 
4922-4931. High density colony filter of BAC clones were generated and screened according 
to the procedure of Clemson University Genomic Institute (Tomlrins JP, Mahalingham R, 
Smith H, Goicoechea JL, Knap HT, Wing R (1999) Plant Mol Biol 41: 25-32). A genomic 
library from a radish plant homozygous for the restorer was constructed in the fosmid vector 
pFOSl (New England Biolabs) according to the manufacturer's instructions. The library was 
screened by successive rounds of colony lifts as described in Sambrook et al (1989) 
Molecular Cloning: A laboratory Manual. 2 nd ed. Cold Spring Harbor laboratory Press, Cold 
Spring Harbor, NY. 

BAC and fosmid sub-cloning and sequencing 

[00116] BAC and fosmid DNA was isolated from exponentially growing E. coli 
(Electromax DH10B, Gibco) cells using a Qiagen Large-Construct Preparation Kit. The DNA 
was sheared to a size of approximately two to six kb with a Hydroshear (GeneMachines) 
according to manufacturer's recommendations. The ends of the sheared fragments were 
blunted using Gibco T4 DNA polymerase. The pBluescriptn phagemid (Stratagene) was used 
as a vector to sub-clone the blunted sheared DNA fragments. To prepare for ligation, the 
vector was digested with the restriction enzyme EcoRV (Amersham-Pharmacia Biotech) and 
dephosphorylated with Calf Intestine Phosphatase (CIP, New England Biolabs). The vector 
was then separated on a 0.7% agarose gel, excised and gel purified using the Sephaglas Kit 
(Amersham-Pharmacia Biotech). This digested, dephosphorylated and gel purified vector was 
ligated to the blunted sheared DNA fragments using the T4 DNA ligase (Gibco). After an 
overnight incubation at 16 deg. C, the ligation mixture was chemically transformed into 
Ecoli (XLl-Blue, Stratagene) as described in The NEB Transcript 6: (1) 7. Recombinants 
plasmid DNA was isolated with an Autogen 740, restricted with BamHl-Xhol (Amersham- 
Pharmacia Biotech) enzymes and separated on 0.8% agarose gels to determine insert size. 
Isolated plasmids were also sequenced with Li-COR DNA sequencers LONG-READIR 4200 
according to manufacturer's recommendations. 

Assembly of BAC and fosmid sequences 

[00117] Raw trace files were obtained in SCF format from one of ten Li-COR sequencers. 
Read lengths of approximately 800bp were obtained for each of the forward and reverse 
strands from a particular clone. The trace files were collected in a single directory on a 
UNIX server. The Staden software package was installed and configured on a Sun 420R 
server. 
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[00118] Pregap4 was used to process the SCF files and generate EXP files, which were 
assembled into a contiguous sequence. Phred was used to produce a quality score for each 
base within each read The raw sequence was clipped according to a minimum quality score 
of 15 as produced through the use of Phred Cross match was used to screen out any 
contaminating cloning vector sequence. Similarly Cross Match was used to mask any 
sequencing vector found in the raw sequence. RepeatMasker was used to mask any region 
that contained repetitive sequence, which may skew the assembly process. Once processed, 
the resulting EXP files were assembled using the Phrap program. Assembly was performed 
with a minimum exact match value of 12 and a minimum SWAT score of 30. 
[00119J The resulting assembly produced a preliminary sequence assembly, which was 
analyzed with the Gap4 software package. A minimum of five-fold coverage was deemed to 
be acceptable in considering a particular region of sequence completed and correct. The 
assembly was meticulously analyzed for incorrect and erroneous sequence position manually. 
Chimeric clones were discovered and removed. These were visible using the. known insert 
size of a particular clone and comparing the position of the reverse and forward reads in the 
assembly. Primers were designed using the Oligo 6 software (Molecular Biology Insights) in 
regions that contained potential mismatches or discrepancies. PCR amplifications were 
perforrmed, amplification products were cloned and the resulting sequences were assembled 
and the correctness of the sequence determined. 

[00120] Further evidence of sequence correctness was provided by restriction enzyme 
profiles of the individual BAC and fosmid clones and the corresponding consensus sequence. 
The use Gap4 Software and the recognition sites for EcdBl, EcoRV and HindUL produced an 
expected pattern of size fragments. These were then compared to restriction analysis 
performed in the laboratory, which concurred the correct sequence. 

Sub-cloning and plant transformation 

[00121] Constructs containing the Rfo genes of interest are introduced into Brassica napus 
as described below. B. napus seed are sterilized in 20% commercial bleach for 30 minutes 
with shaking. The seeds are then washed 5 times in a sterile funnel and filter paper with 
sterile distilled water. The seeds (-20 per plate) are placed on Bn I (seed germination 
medium, see below), wrapped in Parafilm and incubated at 25°C for 5 days. Agrobacteriwn 
with the plasmid of interest is inoculated into 5 ml LB + Kanamycin 100 (100 mg/L) + 
Gentamycin 25 ( 25mg/L) ( for clones in pRD400 vector) or 5 ml LB + tetracyclic (2.5 mg/L) 
+ Gentamycin. 25 (25 mg/L) ( for clones in pOCA18 vector)and incubated at 28°C with 
shaking for 24 hours. 50 |iL of this culture is transferred to 5 mL of LB with the appropriate 
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antibiotics as decribed above and incubated at 28°C with shaking for 24 hours. 5 mL of this 
culture is removed and recovered by centrifiigation. The cell pellet is resuspended in 1 mL of 
MS media (no antibiotic), diluted 1:100 in MS media and used for plant transformation. 
[00122] For transformation, the petiole of 4 to 5 day old Brassica cotyledons are cut with a 
sterile scalpel The cut surface is dipped in the diluted Agrobacterium suspension for 1 
second and the cotyledons on are placed on Bn II (co-cultivation) medium by pushing the 
petiole into the soft agar. 20 cotyledons are placed on each plate. The plates are wrapped in 
Parafilm and incubated for 3 days at 25°C in a growth room under constant illumination. 
After 2 days, the cotyledons are transferred immediately to Bn IV (selection regeneration) 
medium if Agrobacterium growth around the petiole is noted. Otherwise, the cotyledons are 
transferred to Bn IV (selection regeneration) medium after 3 days. Upon transfer, the plates 
are sealed with Parafilm and incubated at 25°C under constant illumination. Cotyledons are 
transferred to fresh plates after 7 days and again every 10 days after that. At this stage a 
maximum of 10 cotyledons are placed on each plate. Callus formation at the tip of the petiole 
occurs within several weeks. 

[00123] At this time, bleached shoots are removed. Putative transgenic shoots that arise 
from the callus are green and may look vitrified. These are excised and transferred to Bn V 
shoot elongation medium. Attached calli are removed from the shoot. In Bn V media some 
shoots will develop roots at which point the plantlets can be transferred to soil. After several 
weeks on Bn V media, if the shoots have not developed roots, they are transferred to Bn VI 
(rooting) media. Once roots have formed, the plantlets can be transferred to soil. If rooting 
seems not to occur after several transfers, the plantlets may be put directly into soil. 
[00124] If the B. napus seed used in the transformation procedure above was an ogu CMS 
line, the recovery of male fertile plants is indicative of the presence of the Rfo gene in the 
introduced DNA. The transgenic plant could also be screened for an ability to restore male- 
fertility by crossing the transgenic plant with a male-sterile plant and determining if the Fl 
generation hybrid is male-fertile. A determination as to whether male-fertility is restored in a 
plant can be made, for example, by 1) visually assessing an increase in the production of 
pollen as compared to a male-sterile plant or 2) determining that the plant can self-fertilize as 
evidenced by placing a bag over a flower on the plant and finding an increase of seed therein 
as compared to a male-sterile plant. 
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Media 

Bn I: Murashige-Skoog (MS) minimal medium (Sigma), 3% sucrose, pH 5 .8 
Bn II: MS, 3% sucrose, 4.5 mg/L Benzyladenine (BA), pH 5.8 
Bn IV: MS, 3% sucrose, 4.5 mg/L BA, 20 mg/L Kanamycin (Km), 500 mg/L 
300mg/L Timentin (Tn), pH 5.8 

Bn V: MS, 3% sucrose, 20 mg/L Km, 300 mg/L Tn, pH 5.8 Bn VI: MS, 3% 

sucrose, 2 mg/L Indole butyric acid (IBA), 300 mg/L Tn, pH 5.8. 

All of the media contain 0.7% w / v phytagar. 

LB: lOg/L Tryptone, 5g/L yeast extract, lOg/L NaCl, pH 7.0 

Analysis of transgenic plants 

[00125] Individual transgenic plants were raised to maturity and visually assessed for male 
fertility/sterility. Small samples of leaf DNA were taken from the plants and analyzed by 
PCR using primers for specific genes and for the NPT1I plant selection maker in the vectors. 
Gene 16 and Gene 26 transgenic plants were further analyzed for the vector copy number and 
integrity by Southern blot analysis. Only plants testing positive for the NPTII, introduced 
gene and orfl38 are listed in Table 1. 

RACE Analysis 

[00126] Total RNA from fertile radish plants and fertile Brassica transgenic plants were 
extracted at flowering stage. The Gene specific primers were designed from the predicted 
exons of Genes 16 and Gene 26. A SMART™ RACE cDNA amplification kit (Clontech) was 
used to generate 5' and 3* RACE products. The PCR products were purified and sequenced 
directly. 

Scientific approach 

[00127J A strategy termed positional or map-based cloning, that allows the identification 
and isolation of genes solely on the basis of the phenotype they confer, was employed to 
isolate these genes. This strategy is based on the principle of genetic mapping. Populations 
are developed in which the gene of interest, in this case the Rfo restorer gene, is segregating 
genetically. In the present example, mapping populations were developed by crossing a 
radish restorer line (a line that is homozygous for the Rfo locus) with a radish Ogura CMS 
line lacking the restorer gene (or homozygous for the recessive Rfo allele of the restorer 
locus). The resulting Fl generation plants were then self-fertilized to create F2 populations. 
Individual F2 plants heterozygous for the restorer gene were self-fertilized to create F3 
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populations. F4 and F5 populations were developed by self-fertilizing F3 and F4 plants, 
respectively. 

[00128] Markers that detect differences in the original parent plants are used to track the 
chromosomal segments from each parent that are transmitted to the different progeny plants. 
In the current case, markers employed were DNA markers, which directly detect DNA 
sequence differences between the parental chromosomes. The restorer gene itself can be 
tracked by its phenotype; plants with the ogu male sterile cytoplasm that lack the restorer 
gene are male sterile (Fig. IB) and possess small sterile stamens that are clearly 
distinguishable from those of male fertile plants with the ogu cytoplasm that contain the 
restorer gene (Fig. 1 A). Rfo is a dominant gene, meaning that a plant need inherit only one 
copy of the Rfo locus to be male fertile; plants heterozygous for Rfo are indistinguishable 
from plants homozygous for Rfo. Plants lacking a copy of Rfo (or that are homozygous for 
the Rfo recessive allele) with the male sterile cytoplasm are sterile. Chromosomal maps can 
be generated that are based on the frequency with which the markers from a particular parent 
are co-inherited in progeny plants: the more frequently two markers are inherited together, 
the closer they are located to one another on the chromosome. 

[001291 For markers very close to the restorer gene, at least one copy of the allele from the 
fertile parent will almost always be found in fertile plants. It is possible to genetically delimit 
the chromosomal region containing the restorer gene on this basis: if the corresponding 
segments of two homologous chromatids of a fertile plant each carry alleles from the sterile 
parent, that segment cannot contain the restorer gene. Similarly, if a chromosome segment 
in a sterile plant carries alleles of the fertile parent, that segment cannot contain the restorer 
gene. 

[00130] The availability of DNA markers that are genetically tightly linked to a gene of 
interest is the starting point for a map-based cloning strategy. These markers are used to 
isolate genomic DNA clones, which may, in turn be used to select additional genomic clones. 
The individual clones so isolated are grouped into "contigs", sets of overlapping cloned DNA 
segments, until a single contig that spans the genomic region surrounding the gene of interest 
is assembled. By introducing the different portions of the cloned region or contig into plants 
of the appropriate genotype, and subsequently assessing the phenotype of these plants, it is 
possible to identify the specific portion of the contig that contains the gene. To characterize 
the gene, the sequence and expression of this specific portion is analyzed. 
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Genetic localization of the radish Rfo locus 

[00131] Rfo mapping populations were developed using both Asian and European radish 
varieties. Initial studies indicated that B. napus RFLP markers that co-segregated with Rfo in 
5. napus crosses mapped up to 60 cM from one another in radish, consistent with the genetic 
distance separating these markers in B. napus crosses that do not involve Rfo (Delourme R, 
Foisset N, Horcais R, Barret P, Champagne G, Cheung W, Landry BS & Renard M (1998) 
Theor Appl Genet 97: 129-134). These mapping studies were extended by developing F3, F4 
and F5 populations. One of the F4 populations (D26/44 F4) comprised 90 individuals and 
became the focus of subsequent analyses aimed at more extensive targeting and mapping of 
markers to the Rfo region. New jR/o-linked RFLP markers, identified by bulked segregant 
analysis (Michelmore RW. ParanL KesseliRV (1991) Proc Natl Acad Sci USA. 88:9828- 
9832.), were mapped on this F4 population. One marker, DLM82, was found to co-segregate 
with Rfo, while two others, DLM 299 and DLM316, were found to map within 4.6 and 9.2 
cM of Rfo, respectively. Recombination between the DLM 82 marker and Rfo was detected 
through analysis of a second radish F4 population (D63/41 F4) of 89 individuals. The results 
from these mapping studies, summarized in Fig. 2, indicated that markers spanning the Rfo 
region had been identified. 

[00132] A radish plant homozygous for ^-associated alleles of all markers in the Rfo 
region was used as the source of genomic DNA to construct a BAC genomic library suitable 
for cloning the gene. A library of 48,000 clones with an average insert size of 43 kb was 
generated and arrayed in 384-well microtiter plates. This library represents the radish 
genome at approximately 4-fold redundancy. To exploit the regional synteny between radish 
and Arabidopsis, Brassica/radish RFLP markers mapping close to Rfo were first hybridized to 
an Arabidopsis BAC library to identify corresponding Arabidopsis clones; these were then 
selected and a contig of overlapping Arabidopsis BACs was constructed using a BAC 
fingerprinting database. Markers derived from these BACs as well as radish RFLP markers 
mapping close to Rfo were used as probes to identify corresponding radish BAC clones. End 
probes from these Arabidopsis and radish BACs were then used to identify additional, 
overlapping BAC clones, and thereby extend the different contigs. 

[00133] Clone DLM82 identified several BAC clones in the Arabidopsis library; end 
probes from one of these detected polymorphism between sterile and fertile radish bulks. 
Starting with this clone, it was possible to assemble an Arabidopsis contig of BACs, which 
represent the segment of the Arabidopsis genome extending from Chromosome 1 nucleotides 
23,391,584 to 23,806,826 (F22C12-F13011) (Arabidopsis sequence coordinates and BACs 
are listed on the Arabidopsis Genome Initiative website: 
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http://www.arabidopsis.org/agi.html). End probes from clones of this region detected 
polymorphism between the radish bulks, indicating that synteny between radish and 
Arabidopsis was maintained in this region. An Arabidopsis BAC derived end probe L12 was 
found to hybridize to two polymorphic radish fragments; these fragments represent two 
linked but independent loci, L12a and L12b. One of these, L12a, co-segregated with B. 
napus marker DLM 82. Two recombination events were found to have occurred in the F4 
mapping population between the L12a and L12b loci, with L12b being located distal to Rfo, 
between the B. napus RFLP markers DLM356 and DLM82. These results are summarized in 
Fig. 2. Probes derived from the ends of other Arabidopsis BACs were found to map within 
the interval defined by L12b and DLM82/L12a. Additional genetic mapping in radish of 
markers derived from this Arabidopsis contig, such as L40 (Fig. 3A) indicated that the 
physical location of the markers was consistent with their genetic position, and hence that co- 
linearity in this region between the Arabidopsis and radish genomes (radish regions rBl-rC 
and Arabidopsis B-C, Fig 3A) was maintained. 

[00134] One Arabidopsis marker located in the vicinity of L12, was found, like LI 2, to be 
duplicated in radish, suggesting that a portion of the syntenic Arabidopsis region is duplicated 
in the radish genomic region near Rfo (region rB2, Fig. 3 A). Further analysis of the radish 
population using markers derived from the Arabidopsis BACs indicated a lack of synteny 
between radish and Arabidopsis for markers located to the right of L40 and suggested an 
inversion of markers in the duplicated region (region rB2, Fig. 3A). This further suggested 
that the genomic region in radish containing Rfo might correspond to the Arabidopsis region 
flanking the L12 sequence but distal to the L40 sequence (region A, Fig. 3B). 
[00135] A chromosome walk from the opposite side of Rfo was initiated using the RFLP 
probe DLM299, which detects a small gene family in radish, only one member of which 
appears to be linked to Rfo. DLM299 was used to recover two radish BAC clones, from 
which derived probes detected differences between the bulked homozyogous fertile and 
sterile radish DNA, indicating linkage to Rfo. The sequences which comprise the DLM299 
region recovered in this manner were found to be located within two radish BACs. Probes 
derived from these BACs were used to recover an additional set of radish BACs. 
Interestingly, an end probe from one of these additional BACs (64K20) mapped closer to Rfo 
than did DLM299. 

[00136] Further localization of the Rfo locus was accomplished by developing and 
analyzing a European radish mapping population of 135 individuals, together with an F5 
Asian radish population of 900 individuals. A radish BAC recovered using sequences from 
Arabidopsis region rB2 (11K10) was sequenced and used to derive a CAPs marker 
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(Konieczny A, Ausubel FM (1993) Plant J 4: 403-410) to facilitate analysis of this large 
population. A CAPS marker was also derived using sequence information from BAC 64K10, 
which maps to the opposite side of Rfo. By screening small scale DNA preparations from all 
900 individuals using these markers it was possible to identify individuals in which 
recombination had occurred in the vicinity of Rfo and which were therefore informative for 
more detailed mapping analysis. In addition, the radish BAC contigs were extended by using 
probes derived from Arabidopsis region A (roughly Chromosome 1 nucleotides 23,000,000 
to 23,400,000 in the Arabidopsis genome; see Fig. 3B) to recover additional radish BACs. 
These, together with radish BACs corresponding to region rB2, were used to recover more 
radish BACs and all these were assembled into contigs on the basis of overlapping restriction 
enzyme profiles. Gaps between the contigs were filled by recovering clones from a cosmid 
library of genomic DNA isolated from a plant known to be homozygous for the Rfo region. 
The physical relationship among the clones from a portion of this contig is shown in Fig. 4. 
[001371 BAC and cosmid clones showing complete genetic linkage to Rfo were sequenced. 
In total, a sequence of over 270 kb, representing a minimum of 5-fold coverage of each 
genomic region, was assembled and annotated. The result of the sequence analysis of this Rfo 
region is provided in the Appendix as SEQ ID NO:87. The sequence was found to contain 43 
putative (or predicted) genes. Those 43 putative gene products and genes, and gene products, 
are provided in the Appendix herein as SEQ ID NOS:l-86. 

[00138] The European radish population segregating for Rfo was found to be 
monomoiphic for markers derived from the region containing genes 3 1 through 41 of the 270 
kb sequence. Since this population segregated for the restorer gene, this observation 
delimited one boundary of the potential Rfo coding region and eliminated predicted genes 31 
through 41 as candidates. This region was also eliminated by a sterile plant in the Asian F5 
population, which was homozygous for the sterile parent alleles for markers in the region 
througjh gene 30, but heterozygous for an allele derived from gene 31. The Asian radish F5 
population also contained a sterile plant that was homozygous for sterile parent alleles 
derived from the region extending from gene 16 through 30, but heterozygous for alleles 
from gene 9 through 15 (genes 1 through 9 were not examined). Accounting for the 
possibility of recombination within restriction fragments that define the RFLPs for these 
alleles, these observations indicated together that the Rfo locus resided within the region 
containing predicted genes 14 through 30. 
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Transformation of genes from the Rfo coding region into Ogura CMS B. napus 
[00139] Plant transformation experiments were conducted to determine the capacity of the 
various predicted genes to act as restorers of Ogura CMS in 5. napus. This test was based on 
the premise that transformation of CMS plants with genomic DNA spanning a restorer gene 
should result in the recovery of male fertile transgenic plants. The genes were sub-cloned 
from clones, individually or in combination with flanking genes, into binary transformation 
vectors, and introduced into Ogura CMS B. napus plants by Agrobacterium-mediated 
transformation. The genes in the Rfo containing region and the various subclones used in 
transformation experiments are shown in Fig. 5. The region roughly corresponds to the 
region extending from the left end of clone 50E19 to the right end of clone 12M3 in Fig. 4. 
[00140] The results of these transformation experiments are summarized in Table 1 . 
Table 1. Plant transformation results 

Recovered plants 



Construct 


Vector 


Genelsl 


Fertile 


Sterile 


P12-33 


pRD400 


14 


0 


1 


PI 1-92 


pOCA18 


14,15 


0 


1 


Pl-Sh20 


pRD400 


15 


0 


23 1 


P2-Sh31 


pRD400 


16 


2 


5 


P24-Sh23 


pRD400 


17 


0 


9 1 


PI 1-76 


pOCA18 


16,17,18,19 


0 


2 


PI 1-96 


pOCA18 


15, 16, 17, 


0 


1 






18, 






P3-Sh8 


pRD400 


18 


0 


2 


Pl-11 


pOCA18 


19 


0 


1 


PI 1-66 


pOCA18 


20 


0 


4 


A4-112 


pOCA18 


21,22 


0 


l 2 


PEP-3 


pRD400 


24 


0 


5 


Bgl-5 


pRD400 


26 


1 


0 


KH8 


pRD400 


28, 29 


0 


1 



] Two of these plants had a few branches with fertile flowers. 
2 Plant had some fertile and some sterile flowers. 
[Please take out the rows for A4-134 and KE7-3] 

[00141] Completely male fertile plants were recovered following transformation with 
clones P2-Sh31 and Bgl-5, which contain the predicted genes, Gene 16 and Gene 26, 
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respectively (Fig. 6). These male fertile plants showed no phenotypic alterations apart from 
the change in floral phenotype from Ogura CMS (left panel, Fig. 6) to the completely male 
fertile transgenically restored phenotype (right panel, Fig. 6). The transgenically restored 
flowers were indistinguishable from normal male fertile B. napus flowers. These 
observations indicate that Genes 16 and Gene 26 can each be used to derive new restorer 
lines from Ogura CMS lines that will be free from phenotypic abnormalities, including the 
high seed glucosinolate character. 

[00142] Partially male fertile plants in which one or more branches contained male fertile 
flowers were recovered in transformants with three other constructs. Interestingly, one of 
these constructs (pl-Sh20) contained a gene that showed a high degree of similarity with 
Gene 16 (Gene 15, see below), while another (P24-Sh23) overlapped with P2-Sh31 and 
contained a portion of the Gene 16 coding sequence (see below). Only completely male 
sterile plants were recovered with most of the introduced cloned DNAs. All the transgenic 
plants recovered in these experiments were analyzed by PCR using primers specific for the 
introduced gene or genes, the NPTII transformation marker, and the mitochondrial gene 
orfl38 9 which is unique to the Ogura male sterile cytoplasm. 

100143] Only plants that tested positive for all the transgenic sequences are listed in Table 
1. For the Gene 16 and Gene 26 plants, these PCR analyses were confirmed by Southern blot 
analysis. Southern blot analysis also indicated that the fertile Gene 16 plants had two 
different transgene insertion events, each of which contained one or two copies of the 
introduced genes. Southern analysis of the fertile Gene 26 plant indicated that it had one 
transgene insertion site. For all completely fertile transformants additional PCR tests 
confirmed that borders of the inserted sequence were intact and matched those in the vector. 
Male fertile flowers of the plants were capable of setting seed upon selfing, and hence were 
female, as well as male, fertile. 

[00144] The specificity of the orfl38 primers was examined by testing their capacity to 
amplify a product from normal male fertile B. napus plants. No such product was observed 
in plants without the Ogura cytoplasm. This shows that the male fertile character of the Gene 
16 and Gene 26 transformants, from which an orf!38 PCR product was obtained indicating 
the presence of the Ogura cytoplasm, is not due to an artifact arising from contaminating 
male fertile cytoplasm seeds in the seed lot used to generate the explants for the 
transformation experiments. To rule out the possibility that mtDNA reairagements involving 
only a portion of the mtDNA might be responsible for the male fertility of the Gene 16 
transformed plants, the floral mtDNAs of these plants were isolated and further analyzed by 
restriction enzyme digestion. The mtDNA restriction digestion pattern was identical to that 
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of the Ogura CMS parental plants used for the transformation experiments, (data not shown) 
indicating that no such rearrangement took place during the transformation/regeneration 
process. Collectively, the analysis of the plants transformed with different portions of the 
potential Rfo coding region indicates that at least two genes in this region are each capable of 
fully restoring fertility to Ogura CMS B. napus plants, Gene 16 and Gene 26, as defined 
herein. 

Characterization of Gene 16 

[00145] The annotation of the approximately 270 kb sequence (SEQ ID NO:87) obtained 
using the program GenScan revealed the presence, in the region contained on construct P2- 
Sh31, of a putative open reading frame starting at nucleotide 103,375 and extending to 
nucleotide 105,589, with a single intron extending from nucleotides 104,498 to 104,588. 
This Gene 16 ORF (SEQ ID NO:32) is predicted to encode a protein of 707 amino acids 
(SEQ ID NO:31). To confirm the gene prediction, 5' and 3* Rapid Amplification of cDNA 
Ends (RACE) analyses were performed on RNA isolated from radish plants homozygous for 
either the fertile or sterile alleles of genes throughout the entire Rfo region, including Gene 
16. These analyses allowed identification of the sequences present in a full length cDNA of 
Gene 16 and hence unambiguous assignment of intron locations and the 5 J and 3' mRNA 
ends. A single product was obtained from both the sterile and fertile allele plants with both 
5* and 3'RACE (Fig. 7). The 5' RACE products of the sterile and fertile alleles were similar 
in size, whereas the 3' RACE product from the sterile allele was slightly smaller than the 
corresponding product from fertile plants. 

[00146] DNA sequence analysis of the fertile allele products indicated that the 5' end of 
the mature Gene 16 RNA mapped at or around nucleotide 102,847, and that the 3' end 
mapped at nucleotide 105,837. Sequence analysis of RT-PCR products confirmed the 
presence of the predicted intron and also revealed the presence of an additional intron 
upstream of the start codon that extended from nucleotides 103,036 to nucleotide 103,353. 
The structure of the Gene 16 mRNA is illustrated in Fig. 8. This Figure also illustrates the 
overlap between the Gene 16 clone and the adjacent clone (P24-Sh23) used in transformation 
experiments that gave rise to two plants with some male fertile flowers. 
[00147] The predicted amino acid sequence of the Gene 16 polypeptide (Genel6p) product 
is identical to that provided in the Appendix (SEQ ID NO:31), since the ATG start codon lies 
downstream of the first, unpredicted intron, and is shown in Fig. 9. The polypeptide shows a 
relatively high degree of sequence similarity to the Gene 15 product (Fig. 10) (SEQ ID 
NO:29), and lower degrees of similarity with its Arabidopsis ortholog (F2K1L10) and the 
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ortholog of Gene 15 (F2K11.11), as well as two paraloguous sequences in the Arabidopsis 
genome (F14N23.29 and T30E16.23). The biological functions of the Arabidopsis genes and 
proteins are not known. The similarity with these putative plant proteins lies mainly towards 
the C terminus. 

[00148] Because the products of restorer genes are expected to act in the mitochondria, 
Genel6p for the presence of a potential mitochondrial transit peptide was analyzed (targeting 
presequence) using the program MitoProtn. This analysis indicated that the N terminal 
region of Gene 16 could function as a mitochondrial targeting site, but the probability 
associated with this was relatively low (approx, 20%). However, if translation started at a 
different methionine residue (residue 419) the N terminus of the resulting product would have 
a high probability of serving as a mitochondrial targeting presequence (95%). Other 
examples where the products of alternate translation inititiation events are targeted to 
different sub-cellular sites, including mitochondria, are known (Martin N and Hopper AK 
(1994) Biochimie 76: 1161-1167). It is also possible that the mitochondrial targeting 
information is located at an internal site in the Genel6p, since there are also examples of 
mitochondrial targeting sequences that are located in the interior of polypeptides instead of at 
the N terminus (Schricker R et al, (2002) J Biol Chem in press) . 

100149] Collectively these analyses indicate that Gene 16 encodes a protein that resembles 
a protein encoded by the adjacent Gene 15 on the radish genome and, to a lesser degree, a 
small family of proteins in the encoded by genes in the Arabidopsis genome. It should be 
noted that some fertile flowers were observed on two plants transformed with the Gene 15 
construct (Table 1), suggesting that this structural homolog can, to a more limited degree, 
functionally overlap with Gene 16 in its capacity to restore male fertility in Ogura CMS B. 
napus plants. 

[00150] A thirty-two amino acid domain was found twice near the N-terminus of gene 16 
and was found also once in genel 5. A domain with the consensus 
"GTPNLAAQGTT^xTPAxQxYPxMF" (SEQ ID NO:91) was found repeating tandemly nine 
times in gene 16 and seven times in gene 15 near the carboxy terminus of the respective gene 
products. A summary of the two types of repeat domains found in the products of gene 16 
and gene 15 with their consensus sequences and the locations of the repeats in the repective 
gene products is summarized in Table 2, 
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Table 2 



Domain consensus 


Gene 


Domain sequences 


Location 
(Amino acid #) 


PVSSE PxQxLGSTSDxSS 


16 


PVSSEPVQPLGSTSDESSGTETTPLAPPPVTT 
(SEQIDNO:93) 


39-70 


GTETTPLAPP xTT 
(SEQE)NO:92) 


16 


PVSSEQPAQALGSTSDQSSGTETTPLAPPITT 
(SEQE)NO:94) 


93-124 


15 


PVSSEPVORLGSTSDOCSGTHTTPLAPP 
(SEQIDNO:95) 


39-66 










GTPNLAAQGTT xTPAxQ 
xYPxMF(SEQIDNO:96) 


16 


GSPNLATYGTTAIPAVQAYAIMF 
(SEQ ID NO:97) 


494-516 


16 


GAPNFTSQGTTATPAFQAFPIMF 
(SEQ ID NO:98) 


517-539 


16 


GTPNIAAQGTTRAPAVQAYPTMF 
(SEQE)NO:99) 


540-562 


16 


GTPNIGVQGSTPAAQTYPLMF 
(SEQ ID NO: 100) 


. 563-583 


16 


GTPNLAAQGTTNIGARGTTPAAQAYPLMF 
(SEQIDNO:101) 


584-612 


16 


GTPNIAAQGTTTPAVQSYPTMF 
(SEQIDNO:102) 


613-624 


16 


GTPNLAGQSTTTTRAGQPYPTTF 
(SEOIDNO:103> 


625-647 




AVPQAATAPAVQPYAMMF 
(SEQ ID NO: 104) 


648-665 


16 


GTPSLGAQDITPGGQAYPA 
(SEQ ID NO: 105) 


666-686 


15 


ATPNLAAYGTTPAVQAYPMMF 
(SEQ ID NO: 106) 


431-451 


15 


GIPNLAAQGTATPSVQAYPMIF 
(SEQ ID NO: 107) 


452^73 


15 


GIPNLAAQGTTATPAFQAYPMIF 
(SEQ ID NO: 108) 


474-496 


15 


GIPNVAAQGTTTTTPAAQAYPMMF 
(SEQ ID NO: 109) 


497-520 


15 


GIPNLAAQGnTPAAQPYPTMF 
(SEQ ID NO: 110) 


521-542 


15 


GTPSLAAQGTTTAPAVQPYPTMY 
(SEQ ID NO: 111) 


543-565 


15 


GTPNFVAQGMTPAAQAYPVNG 
(SEQ ID NO: 112) 


566-586 



X indicates possible substitutions of various amino acids and _ indicates site of possible 
insertion of one or multiple amino acids. 



Characterization of Gene 26 

[00151] The GenScan annotation indicated the presence of a putative open reading frame 
on the strand complementary to that in the Appendix (SEQ ID NO:87) mat extends from 
nucleotides 173,669 to 167,079, with introns extending from nucleotides 173,613 to 171,029; 
170,894 to 169,686; and 167,581 to 167,198. This region is contained within the Bgl-5 
construct. The Gene 26 OKF is predicted to encode a protein of 804 amino acids (SEQ ID 
NO:51). To confirm the gene prediction, 5' and 3' RACE analyses were performed on RNA 
isolated from radish plants homozygous for either the fertile or sterile alleles of genes 
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throughout the entire Rfo region, including Gene 26. A single product was obtained from 
fertile allele plant RNA using both 5' and 3' RACE (Fig. 11); neither a 5' nor a 3' RACE 
product could be amplified from sterile allele plant RNA (not shown). 
[00152] The structure of gene 26 mRNA as deduced from the analysis of RT-PCR 
products is illustrated in Fig. 12. Sequence analysis of 5' RACE product indicated that the 5' 
of the Gene 26 mRNA mapped near nucleotide 171,000 and that the 3' end mapped at or near 
nucleotide 167,000. The mature mRNA lacks the second intron sequence indicating this 
sequence is removed by RNA splicing. The sequence of the third predicted intron was found 
within the mRNA. As a result, the mRNA encoded peptide is 707 instead of 804 amino acids 
in length. Fig. 13 depicts the polypeptide (SEQ ID NO: 88) encoded by the newly defined 
Gene 26 ORF (SEQ ID NO:89). The N-terminus of Gene 26p is predicted to function as a 
mitochondrial targeting presequence with a likelihood of over 98% that Gene 26p is directed 
to the mitochondria Gene 26 contains 15 repeats of a 35 amino acid consensus sequence, 
the pentatricopeptide or PPR motif VT YNSLIS GYCKAGKLEE ALELFKEMKEKGIKPD V 
(SEQ ID NO:90) (Small ID & Peters N (2000) Trends Biochem Sci 25: 46-47). The 
sequences of the repeated domain (PPR (pentatricopeptide repeats) and their locations in gene 
26 product are summarized in Table 3. 



Table 3 



Domain 
consensus 


Gene 


Domain sequences 


Location (Amino 
acid#) 


VTYNSLISG 
YCKAGKLEE 


26 


YSFMLIKCFCSCSKIJFAI^TFGKITKLGLHPDV 
(SEQIDNO:114) 


115-149 


ALELFKEMK 
EKGKPDV 


26 


V'lFilLiilGLCVEDRVSKALDFFHQM^'E'lTCJKPNV 
(SEQIDNO:115) 


150484 


(SEQ ID 
NO: 113) 


26 


VTFITLMNGLCREGRIVEAVALIJ)RMMEDGLQPT 
(SEQE>NO:116) 


185-218 




26 


ITYGTIVDGMCKKGDTVSALNLLRKMEEVSHIIP 
(SEQIDNO:117) 


220-255 




26 


VTYSAIIDSLCKDGRHSDAQNLFTEMQEKGIFPD 
(SEQIDNO:118) 


256-289 




26 


FTYNSMIVGFCSSGRWSDAEQLLQEMLERKISPDV 
(SEQIDNO:119) 


291-325 




26 


VTYNALINAFVKEGKFFEAEELYDEMLPRGIIPNT 
(SEQ ID NO: 120) 


326-360 




26 


ITYSSMIDGFCKQNRLDAAEHMFYLMATKGCSPN 
(SEQIDNO:121) 


361-394 




26 


ITFNTLIDGYCGAKRIDDGMELLHEMTETGLVADT 
(SEQ ID NO: 122) 


396-430 




26 


TYNTLIHGFYLVGDLNAALDLLQEMISSGLCPD 
(SEQIDNO:123) 


432-464 




26 


VTCDTLLDGL(^NGKIJCDALEMFKVMQKSKKDLDAS 

HPFNGVEPDV 

(SEQIDNO:124^ 


466-511 




26 


TYNILISGLINEGKFLEAEELYEEMPHRGIVPDT 
(SEQIDNO:125^ 


513-546 
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26 


1TYSSMTOGLCKQSRLDEATQMFDSMGSKSFSPNV 
(SEQIDNO:126) 


547-581 


26 


VTFTTLINGYCKAGRVDDGLELFCEMGRRGIVAN 
(SEQIDNO:127) 


582-615 


26 


ITYITUCGFRKVGNINGALDIFQEMISSGVYPDT 
(SEQ ID NO: 128) 


617-651 



[00153] The PPR motif is found in other nuclear-encoded regulators of organelle gene 
expression in both fungi and plants (Fisk DG et al (1999) EMBO J 18: 2621-2630; Coffin JW 
et al (1997) Curr. Genet 32:273-280; Manthey GM and McEwen JE (1995) EMBO J. 14, 
4031-40) and PPR protein-encoding genes represent a large gene family in the Arabidopsis 
genome (Aubourg et al (2000) Plant Mol Biol 42: 603-613). The predicted mitochondrial 
localization of Gene 26p and the presence of multiple PPR domains within it are consistent 
with its role in regulating the expression of the orfl38 ORF associated with Ogura CMS. 
[00154] The above example provides evidence that multiple genes within the sequence 
listed in the Appendix as SEQ ID NO:87 can function as fertility restorer genes. These 
experiments, however, do not exclude the possibility that other genes within the region can 
also function as restorer genes. For example, two partially fertile plants were obtained after 
transformation with the Gene 15 (SEQ ID NO:30) construct. Thus Gene 15, like Gene 16, 
may be able to function as a restorer gene. Similarly, two partially fertile plants were 
recovered after transformation with the Gene 17 (SEQ ID NO:34) construct, although in this 
case, the restoring factor may have been derived from expression of the portion of Gene 16 
included in the Gene 17 transformation construct. A plant transformed with a construct 
containing Gene 21 (SEQ ID NO:42) and Gene 22 (SEQ ID NO:44) also produced some 
fertile flowers. Gene 24 (SEQ ID NO:48), Gene 26 (SEQ ID NO:52) and Gene 27 (SEQ ID 
NO:54) also contain PPR domains and are therefore expected to restore fertility phenotype 
according to the present invention. As discussed above, the preliminary analysis indicated 
that the restorer functions are most preferably located between Gene 14 and Gene 30 within 
the Rfo region as shown in the Appendix. 

[00155] There are several reasons why not all of the transformants recovered following 
transformation with a specific construct necessarily show the same phenotype. The 
expression of the genes encoded in a specific construct may vary depending on the site of 
insertion, the number of copies of the gene at each insertion site, and other factors such as 
transgene silencing. Therefore, it is apparent that other genes in the genetically defined Rfo 
containing region can likely be found in view of the present disclosure to function as genes 
that confer complete fertility restoration. Given that Rfo maps as a single genetic locus in 
radish, the invention provides that two or more different genes in the region are able to 
function as restorer genes for Ogura CMS in B. napus. 

52 



WO 03/006622 PCT/US02/22217 



SEQUENCE LISTING APPENDIX 

FEATURES Location/Qualifiers 
source 1. .271256 

gene join(95. .1292,1600. .1619, 1623 . .1628) 

/gene= " Pept ide_l " 

/note="GenScan; Pl=0.650, P2=0.068, P3=PlyA» 
predicted_peptide_l 1 405_aa 
MCKPNVVTFTTLMNGLCREGRVVEAVALLDR^ 

LNLLRKMEELSHI KPDWI YSAI I DGLWKDGRHTDAQNLFI EMQDKGI F PDI VTYS CMIN 
GFCSSGKWSEAQRLLQEMLVRKIS PDVVTFSGLINALVKEGDLNSAQDLLQEMI SSGVCP 
NVVTCNTLLDGLCDRGKLKDALEMFKAMQKSMMD I DATHAFNGVE PDVQTYNILI SGLIN 
EGKFLEAEELYEEMPHRGIVPDTVTYSSMINGLCKQSRLDEATQMFDSMGSKSFSPNIVT 
FNTLITGYCKAGMVDDGLELFCEMGRRGIVANAITYITLIRGFRICVGNINGSLDIFQEMI 
SSGVYPDTITIRNMLTGLWSKEELKRALAMLEELQMSMVSFMRLL 
(SEQ ID NO:l) 

CDS predict ed_CDS_l 1 1218_bp 

atgtgtaaaccaaatgtcgtaaccttcaccacgctgatgaacggtctttgccgcgagggt 
agagttgtcgaggccgtagctctgcttgatcggatggtagaagatggtctccagcctaac 
cagattacttacggaacaattgtagatgggatgtgtaagatgggagacactgtgtctgca 
ttgaatcttctgaggaagatggaggagttgagccacatcaaaccggatgtggtaatctat 
agtgccatcattgatggcctttggaaagacggacgtcataccgatgctcaaaatcttttc 
attgaaatgcaagacaagggaatctttccagatatagttacctacagctgtatgattaat 
ggattttgtagctctggtaaatggagtgaagcccagcgcttgttgcaagaaatgttagta 
aggaagatcagccctgatgttgtaactttcagtggattgatcaatgcattggtcaaagag 
ggcgatcttaattctgctcaagaccttttacaggagatgatttctagtggtgtgtgccct 
aatgtcgttacttgtaacactttgctggacggtctctgcgatcgcgggaaactaaaagat 
gcattggaaatgtttaaggctatgcagaagagtatgatggacattgatgctactcatgcc 
ttcaatggtgtggaacctgatgttcaaacttacaatatattgatcagcggattgattaat 
gaagggaagtttttagaggccgaggaattatacgaggagatgccccacaggggtatagtc 
ccagatactgttacctatagctcaatgatcaatggattatgcaagcagagtcgcctagat 
gaggctacacaaatgtttgattcgatgggtagcaagagcttctctccaaacatagtgaca 
tttaacacactcattactggctactgtaaggcaggaatggttgatgacgggctggagctt 
ttctgcgagatgggtcgaagagggatagttgctaatgcaattacttacatcactttgatt 
cgtggttttcgtaaagtgggtaatattaatgggtctctagacattttccaggagatgatt 
tcaagtggtgtgtatcctgataccattactatccgcaatatgctgactggtttatggagt 
aaagaggaactaaaaagggcactggcaatgcttgaggaactgcagatgagtatggtatcc 
tttatgcgtctattgtag 
(SEQ ID NO: 2) 

gene complement (join(1810 . .1815,2321. .2722,2845. .2884)) 

/gene= 11 Pept ide_2 " 

/note= n GenScan; Pl=Prom, P2=0.588, P3=PlyA" 
predictedj?eptide_2 1 133__aa 

MISSLTHMGADKAGRNQYEEEEKEEEDGIRAITLSGSNLGATMKTELDDNHGDSYKNGGQ 
ELDFLTTFVNSNFQAVNNS IMMGAKYETHDPGVHLDI SGDVEKPLMKAPGRGWRERKGKT 
QARRDRRESEHTD 
(SEQ ID NO:3) 

CDS predicted_CDS_2 1 402_bp 

atgatctcctccttaacccacatgggagcagacaaggcaggacggaaccaatatgaagaa 
gaggagaaggaggaggaggatggcatcagagccatcacgctctctggatccaaccttgga 
gccaccatgaagaccgagcttgatgataaccatggagacagctacaaaaacggtgggcaa 
gagcttgatttcctgaccacttttgtaaacagcaactttcaagctgtgaacaactccata 
atgatgggcgcaaagtacgagactcatgatccaggagttcatcttgatatctcaggcgat 
gtggagaaacctttgatgaaggcacctgggagggggtggagggaaaggaaagggaaaact 
caagctagaagagatcgtcgagaatctgaacatacagattga 
(SEQ ID NO:4) 
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gene join (3108 . . 3147, 6085 . . 6246, 6324 . .6479,6565. .7151, 

7734. .7929,8893. .9246,10434. .10642,11676. .11921, 
11960. .12011,12023. .12095,14659. .14749,15776. .16004, 
16104. .16109) 
/gene="Petide_3" 

/note="GenScan; Pl=Prom, P2=0.998, P3=0.958, P4=0.987, 
P5=0.521, P6»0.042, P7=0.070, P8=0.410, P9=0.456, 
P10=0.597, Pll»0.652, P12=0.882, P13=PlyA" 
predict edj?eptide_3 1 784_aa 
MTSLFSKARALTSLGSYFKTYRPVTGKLHVAT^^ 

TPTLVTRKPVPELES PKKTGVLVNGTSLLNQNRSGSYS ETSLWLNKTKS YNQPTI I TTKP 
VHVPVPVPEKKSTKKTGISVDCTSFLNQNRSGSCSRTAQPGYN^QTESNRSCPDYFKWI 
HEDIiKPWRETGITREMVERGQTTAHFRLVI VNGKVFVENYKKS IQTRDEFTLWGI LQLLR 
KYPGKLPDVDLl^DCDDRPVIRLIXJYSK^MTAENAPPALFRYGGDRWTADVVFPDWSFW 
GWQEINIKPWSKVLTEMEKGKKKKKFl^REAYAYWKGNPFVASPSREDLLTCNVSSQHDW 
NARI FIQAQEIGRGASEFMQRDLSMENVYDYMFHLLNE YSKLLKFKPQVPQNS VE I CTEA 
MVCPSGDANDTNKRFLMGSLVDEPHNSGPCSLPPPFDPNGLEKFYRKKLNLIRQVEKW^ 
AYWENHRSRSNIGLTFYRTRRTKLERI RKYRFRS ESMLKYLLDLFYGDPQVS VWI YVLQE 
TQSGIRNTQKLLYIRKKYKTTLYMRFKHRSRSNIGLTFYRTRRTKLERIRKYRFRSESML 
KYLLDLFYGDPQVSWIYVLQETQSGIRNTQKLLYIRYFFKFQVKVFRYNFRFRISKXYN 
LGVRVKFEIPFGFSDLILGIjKRFETIYSGLYDCFETLSTAINRKNCVCSQVAERRLTRRR 
IQPPSSSLSPLHEACSPSSRSSSPSILSDNRSSRVQSRRRQRRSRRCQRQRQRRIAKSNS 
TSEL 
(SEQ ID NO: 5) 

CDS predicted_CDS_3 1 2355_bp 

atgacatctctcttctccaaagcccgagctctcacttctctgggctcctactttaaaacg 
taccgtccggtcaccggaaaactccatgtggccaccttgacgcttctcctcttccttctc 
gcagccgctgttgccgtcacctcttctctatggcttagtaagacgacgaaacaatttgat 
acaccgacattagtcacaagaaaaccggtacccgagctagaatcaccgaagaaaaccgga 
gtactggtaaatggcacaagtttattgaatcaaaaccggtctggttcttactcagaaaca 
tctctatggcttaacaagacaaaatcatataatcagccgacaataataacaacaaaaccg 
gttcacgtaccagtaccagtaccagagaagaaatcaacgaagaaaaccggaatctcggta 
gattgcacaagtttcttgaaccaaaaccggtccggttcttgctcgagaacagctcaaccc 
ggctataataataaccaaaccgaatcgaaccggtcatgtcctgattacttcaagtggatc 
cacgaggatctaaagccatggagagagacggggataacgagagaaatggtggagagagga 
caaacgacagcgcatttcaggttagttatagtaaacggcaaagtgttcgtcgaaaactac 
aagaagtctatacagactagagacgagttcacactgtgggggattcttcagctgctgaga 
aagtatccagggaagttgcctgacgtggatctcatgttcgactgtgatgatcggcctgtt 
attagattggacggttacagtaaatctaatcatacagctgaaaatgcaccacctgcgtta 
tttagatacggcggagatagatggacggcggatgtcgtctttccagactggtcattctgg 
ggatggcaagagattaacataaagccatggagcaaagtgttgacagaaatggaaaaagga 
aagaagaagaagaaatttatggagagagaagcttatgcatattggaaagggaaccctttt 
gttgcatctccttcgagagaagatcttcttacttgcaatgtatcctcacaacatgattgg 
aatgctagaattttcattcaggctcaagagattggaaggggagcaagtgagttcatgcaa 
cgagatctatcaatggaaaacgtgtatgattacatgttccatttgttgaatgaatactca 
aagcttcttaagttcaagcctcaagttccccaaaacagtgttgaaatctgcacagaagca 
atggtgtgcccttctggagatgctaatgatactaataagagatttttgatgggctcttta 
gtcgatgagcctcacaattcaggcccatgttcgctacctcctccttttgatcccaacggt 
ctcgagaagttttataggaagaaactgaatctcatccggcaagttgagaaatgggagcac 
gcttactgggaaaaccacagatcaagatctaatataggtctaacattttaccggacccga 
agaaccaaactagaacggatccgaaaatacaggttcagatccgagtctatgctaaaatat 
ttattggatcttttttatggggacccacaagtttcggtttggatctacgtcctacaagag 
acccaatcgggtattcgaaatactcaaaaattattatatattagaaaaaaatacaaaaca 
acactatatatgcgcttcaagcacagatcaagatctaatataggtctaacattttaccgg 
acccgaagaaccaaactagaacggatccgaaaatacaggttcagatccgagtctatgcta 
aaatatttattggatcttttttatggggacccacaagtttcggtttggatctacgtccta 
caagagacccaatcgggtattcgaaatactcaaaaattattatatattagatattttttt 
aagtttcaggttaaggtttttaggtataatttcaggtttcggatttctaaaaaatataat 
ttgggtgttcgggtaaaatttgagatacctttcgggttttcggatctgattttgggtttg 
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aagcggtttgaaacgatttatagcggtttatatgattgttttgaaacattgtcaaccgct 
atcaaccgcaaaaattgcgtttgcagtcaagttgcagagaggaggcttactcgtcgtcgc 
atccagccaccttcgtcgtcattgtctcctctccacgaggcttgctcgccgtcttcccgg 
tcgtcttctccaagtatcttgagtgacaatcgcagcagcagagttcagagtcgaaggcgc 
cagaggcgaagtcgaaggtgccagaggcagagacaaaggcgcattgctaaatcaaactca 
acatccgaactttaa 
(SEQ ID NO: 6) 

gene coinplement (join (16223. .16228,17060. .17321, 18349. .18367, 
20705. .21068,21165. .22997,23102. .23701,23876. .23915) ) 
/gene= " Pept ide_4 ,r 

/note= n GenScan; Pl=Prom, P2=0.681, P3=0.524, P4=0.727, 
P5=0.287, P6=0.668, P7=PlyA" 
predicted_peptide_4 1 1025_aa 

MLTNELDKAYEEEEKYWRQRSRILWLQHGDHNSSYFHAITRSRKAANKFSVIiEKQDGTAV 
FAEEQIAAS I EEYYDTLFTS ASQGKSQ 1 1 EDAI S PRI S PEMNATLTS I PDDLE I KRAVFA 
IHRDKAPGPDGFSASFYQGFWDIIGEEVCREVRDFFISGQLHRRFNEraVRLIPKIKTPK 
TVTDYRP I ALCSTHYKVIAKI LHYLCTS SAKKHCTMAVKTDMSKAYDRLEWS FLRDVMVR 
FGFHPTWVHWIMECVSSVSYSYLINGGAHGCVVPSRGIRQGDPLSPYLFILCSEVLSGLC 
SKAFEQGKLCGIKVARNSPPLNHLLFADDTMFFCKSDPISCKELKNIFEKYEVA^ 
CLKSAITFSAKTPIETRRRVKAELNIVGEGGIGKYLGLPEHFNRKKRDIFASITDRIRQK 
SHSWTSRYLNGAGKLVLLKSILSAMPTYAMTCFKLPKSLCKQIQTVLTRFWWDDKPDH^ 
MSWVAWSTLTLPK^GGLGCREIEKFNDALLAKLAWRILKFPESLLAQTLAGKYCHSTPF 
LSTPAPKSASHGWRGVT AGREVLRQGLGWVT GNGSDINAWSD PWLS PKTPMCPMGPPTEQ 
NKELKVSDLLNGITKEWDLPAIRLHLPQYEEHILKLVPSEFHMKDELCWIiHTRSGEYSTK 
TGYPYWKTNRGEELTDFNWNLCI WQ I RTS PKLKHFLWKIKS KDLPVGANLLHRGI QVEGR 
CKRCGLIETERHVFLQCPFARRVWDLVPVMFKPDPAIITSPEALLQTSRRIVNLPPLG^ 
ETDLYPWI FWYLWI GRNMLI FENREGSEQELARQVSNLDVEAQCFVDAAWNAGTSGGGFG 
CIFKDMSNKTFHRSSSNRSIVGSALIAEALAVKAGLKAARSLGLRKLVIRSDSKSLIMAI 
NTKEKI VEAQGVLFDI DHLCTLFNS VS FEFVSHDS PRLNHNNLNADPSRWI HFTQLADPR 
LYLHHEPPPNSINHHAYRREGQDPPKGQKENEIEKRTLQEKCVLIADQDSVFSI»SAMVDI 
SVSFR 
(SEQ ID NO:7) 

CDS predicted_CDS_4 1 3078__bp 

atgttgacaaacgagcttgacaaagcctacgaggaggaggagaagtattggcgtcaacgt 
agtagaatcctctggctgcaacacggagaccataattccagctactttcatgccattact 
agaagccgcaaggctgccaataaattctctgttcttgagaaacaagatggtactgctgtg 
tttgcagaagaacagatagcagcctctattgaagagtactatgataccctgttcacatcg 
gcgtctcaagggaaatctcagatcatagaggatgctatctcacccagaatctcccctgaa 
atgaatgcaacactcacaagtataccagatgatttggagatcaagcgagcagtgttcgct 
attcatagagacaaagctccgggacccgacggtttttcggccagtttctatcagggattt 
tgggacatcataggagaagaagtctgtcgggaggtcagagatttctttatctcgggtcaa 
cttcatcgtcggtttaacgagactcacgtccggttaataccaaaaatcaaaacgcccaag 
acagtaacggattacaggcccatagcactctgctctacacactataaggtcatcgcgaag 
atccttcattacctctgtacctcatcagcaaagaagcattgcaccatggccgtcaaaacg 
gacatgagtaaggcctatgatcgactggagtggagtttcttgagagatgtcatggtccga 
tttggcttccaccctacatgggtgcattggattatggaatgtgttagctcggtttcgtac 
tcatacctcatcaatgggggcgctcacggttgtgttgtaccgtcgcgtgggatccgacaa 
ggcgacccgctgtctccctatttatttattttgtgctctgaggtcttatcgggactctgc 
tctaaggcgtttgaacaaggaaagctgtgtggcatcaaggtagcgcgcaattcaccacct 
ctaaatcatctcctcttcgccgatgatacgatgttcttttgcaagagtgacccgatatct 
tgcaaagaactaaagaacatctttgagaaatatgaggttgcttctggccaaagcatcaat 
tgtctgaagtctgctataaccttctctgccaaaacacccattgaaacaagacgcagggtc 
aaagcggagcttaacattgtgggtgaggggggtataggtaaataccttggcctcccagag 
cacttcaacagaaagaagagagacatatttgctagcattactgatcggatcaggcaaaag 
tcgcatagctggacctctaggtacctcaatggcgcagggaagttggtcctcttgaagtct 
attctctctgctatgcccacctatgcgatgacgtgcttcaagttgcctaagtcactctgc 
aaacagatacaaactgtgctcactcgcttttggtgggacgataaaccagatcatcgtaaa 
atgtcttgggttgcttggagtactcttactctacctaagagagctggcggtttgggatgt 
cgggagattgagaagtttaatgatgctctgttagcaaagttggcctggcgaatactgaag 
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tttccagagtccttgctggctcaaacactggcagggaagtactgtcactccaccccattc 
ctctcaactccagctcctaaatcggcttctcatggttggcgtggagttatagcgggcaga 
gaggtgttacgccaaggcctaggatgggttattggtaacggatcggacatcaacgcctgg 
tctgatccttggctctccccaaagactccaatgtgtccaatgggtccacctacagagcag 
aacaaagagctgaaagtatcggaccttttgaatggcattacaaaggaatgggatctgcct 
gcgattagactacacttacctcagtatgaggaacacattttgaaactagttcctagcgag 
ttccacatgaaggatgaactctgctggctccatacacgttctggagagtactcaaccaag 
acgggctacccatactggaagacaaatcgtggtgaagagctgactgacttcaactggaac 
ctctgtatatggcaaattagaacttccccgaagctgaaacatttcctatggaagatcaaa 
agtaaagatttgccagttggtgcaaatcttttacacagagggattcaagtggaagggcgt 
tgtaagagatgtggattgatagaaacagagcgtcatgtttttctacaatgcccatttgcc 
cgtcgtgtttgggatctggtgcctgtcatgttcaaaccagacccagcgatcatcacctcc 
ccggaagctcttttgcaaacctcgcgacgcatagttaatctgccaccactgggccttggt 
gaaactgatctctatccttggatcttctggtacttgtggatagggagaaacatgttgatt 
tttgagaaccgagagggatcagagcaagaactggctcgtcaggtttctaacttggatgtg 
gaggctcaatgttttgtagatgcagcgtggaatgctggcaccagtggtggaggatttggg 
tgcatcttcaaagatatgagcaacaagaccttccatcgctcttcatcaaaccgcagcatc 
gtcggctctgctctaattgcagaagctcttgcggttaaagctggcttaaaagcggcacgg 
tctctgggtttgcgtaagctggtcattcgatcagattccaagtcccttatcatggccatt 
aacaccaaagagaagattgttgaagcacaaggagttctttttgatattgatcacctttgt 
actctttttaattctgtttcgtttgagtttgtttctcacgactctccacgactaaaccac 
aacaacctaaatgctgacccatcccggtggatccacttcacacaacttgctgatccacgg 
ctatatctacatcatgaacctccaccaaactccataaatcaccatgcatatcgtcgggaa 
ggccaagatccaccaaaagggcaaaaggaaaatgaaatagaaaagaggacactacaagaa 
aaatgcgttttaatagcagaccaggatagcgttttttcgctttctgctatggttgacata 
tccgttagttttagatag 
(SEQ ID NO: 8) 

gene join(24908. .24947,26266. .26286,27104. .27150,27332. .27454, 

27545. .27609,27695. .27952,28046. . 28251, 28776 .. 28940> 
29079. .29165,29713. .29805,29894. .29911, 30085 . . 30090) 
/gene="Peptide_5« 

/note="GenScan; Pl=Prom, P2=0.000, P3=0.000, P4=0.845, 
P5=0.976, P6=0.993, P7«0.996, P8«0.635, P9»0.883, 
P10*0.490, Pll=0.460, P12=PlyA" 
predictedjpept ide_5 1 3 60_aa 

MPSPLTKKPSVQWPNGKAAPRDWFSSNSSAOTAGLCKSSQLKVADFSYNFPVGNIPKCLE 
HFPRTSFQGNCMQNKDLKHRPSSQCGNAQLVKTHESPSSPPKHQSAEIVAKHHKTSRPKW 
LLALEIVTGSMVALFCLVALFSAVHRWQNRPSLI I PWKKS SSLKEKFAVYVDSEMLKDVS 
RFTRQELEVACEDFSNI IGLSADSQVYKGTMKGGPEI AVI SLCVKEEDWTGYLELYFQRE 
LVDFECmTILARSEKNLRNINSEGAVCVLPNAME^ 

ETPEAMTSLVDPELKHFNQEELETSFDYFKKPPISWLVRLICSHDVECTQSSDRQHATVA 
(SEQ ID NO: 9) 

CDS predicted_CDS_5 1 1083_bp 

atgccttcgcctctaacgaagaaaccttcagtccaatggtttaacgggaaagctgccttc 
agagattggttttcttcaaattcaagtgctaatactgctggtctgtgcaagtcttctcag 
ttgaaagtagctgacttttcatacaacttctttgtgggaaacattccaaagtgtttggag 
cactttccaaggacgagctttcaagggaactgcatgcaaaacaaggatcttaagcacaga 
ccatcttcccaatgcggtaacgcacagctggtcaaaactcatgaaagtcccagctcccca 
ccaaagcaccagtcagctgaaattgtggctaagcatcataaaacatcaagacctaagtgg 
cttcttgcgcttgagatagtcacaggatcaatggtcgctttgttctgcctggttgcactt 
ttctcagcagttcaccgctggcaaaacaggccttctctcatcattccttggaagaaatct 
tcaagcctaaaggaaaagttcgcagtctacgttgattctgaaatgctcaaggatgtctcg 
agattcacaaggcaagagcttgaggtggcgtgtgaagacttcagcaacatcattggttta 
tctgctgatagtcaggtctataaaggaacaatgaaaggtggacctgagattgctgtgatc 
tctctttgcgtcaaagaagaagattggaccggatacctcgagctctatttccagagagag 
ctggttgattttgaatgctggaagacgattcttgcgagatcagagaagaatttgagaaat 
attaacagtgaaggtgcggtatgcgtgctcccaaacgcaatggagagtcgatatctggat 
gtgtctggtaatatatatgcatttggtgttcttttgctggagattgcaaaggagtatctt 
gaaacaccagaggcaatgacgagtttggtggatccggagctgaagcattttaaccaagaa 
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gagcttgaaacgtcttttgactattttaaaaagccgccaatttcatggctggtacgactt 
atctgttctcacgacgttgagtgccatcaaagctcagaccgccagcacgcaacggttgca 
tag 

(SEQ ID NO: 10) 

gene join (30646. .30685,30804. .31118,31322. . 32725 , 32773 . . 32812 , 

32856. .33067,33172. .33294,33383. .33862,33935. .34258, 
34716. .34873,35478. . 35539 , 35640 . . 35758 , 35824 . . 35991, 
36091. .36123,36192. .36197) 
/gene= n Peptide_6 n 

/note="GenScan; Pl=Prom, P2=0.942, P3-0.995, P4=0.3i0 / 
P5=0.588, P6=0.839, P7=0.999, P8=0.136, P9=0.426, 
P10=0.630, Pll=0.972, P12-0.999, P13=0.999, P14=PlyA H 
predictedjpeptide_6 | ll45__aa 

MATKLLSLTC IRKERFSGRYPPVLRKHLNS RPRGGGDGS PSETAVLKIDEEEAVSDAVFR 
VTGMTCSACAGSI EKEIKRLPGIHEAVIDALNNRAQIQFYPTS VDVETI RETIEDAGFEA 
SLIENEANERSRQVCRIRINCMTCTSCSSSIESVLQSLYGVQRAHVAIAIEEAEVHYDPT 
LLSYDKLLEEIDNAGFEAILISTGEDVSKIDLKIDGEFTDEAMTMIERSLEALPGVQSVE 
I SHGSDKI SVL YKPDVTGPRNFIRVI ESTVFGHSGHIKATVFSEGGVGRESQKQEEI KQ Y 
YKSFLWSLVFTVPVFLTAMVFMYI PGI KHLLMFKVINMLTVGEI IRWVLATPVQFI IGWR 
FYVGS YKALRRGSANMDVL I ALGTNAAYFYS L YTVLRAATS PDFKGVDF FETS AML I S F I 
ILGKYLEIMAKGKTSQAIAKLMNLAPDTAILLTVDEEGNVTGEEEIDGRLIQK^ 
PG7VKVASDGYVIWGQSHVNESMI TGEARPVAKRKGDTVIGGTLNENGVLHI KVTRVGSES 
ALAQ I VRLVESAQI1AKAPVQKE1ADRI SKFFVPLAQLLLKLTNYNI AAGKLHWYPESWI PS 
SMDS FELALQFGI SVMVI ACPCALGLATPTAVMVGTGVGASQGVLI KGGQALERAHKVNC 
IVFDKTGTLTMGKPVVVKTKLLKNMVLREFYELVM 

ENPTWPEARDFVS I TGTGVRATVKGRE I MVGNKS LMS GYKVT I TADAE ELL AEAE EMAQT 
GILVS IDNELTGVLAVSDPVKPSAREAI S I LKSMNI KS I MVTGDNWGTANS I AREVGI DS 
VI AEAKPEQKAEKI KELQAAGHWAMVGDGVNDS PALVAADVGMAIGAGTDI AI EAADI V 
LMKSNLEDVITAIDLSRKTFSRIRLNYWALGYin^MGIPIAAGVLFPSTRFRLPPWIAGA 
AMAASSMLRNGPRFVSLLVSFGCILFEDDALVLRIRRNKAEI I SNRNGPSLKATRINSAG 
MTNSNYTELNELYNKYKDKGLEILAFPCNQFGEEEPGTTDQITEFVCTRFKSEFPIFNKI 
EVNGENAS PLYKFLKKGKWGI FGDEIQWNFAKFLVDKNGQAVERYYPTTS PLTLEHDI KK 
IiLNLS 
(SEQ ID NO: 11) 

CDS predicted_CDS_6 ) 343 8 _bp 

atggccacgaagctattgtcgctcacatgcatacgcaaagagagattcagcggccgttac 
cctcctgtgctgcggaaacatctaaacagcaggccacgcggcggcggagatggatcaccg 
tcggagacggcggtgcttaagatcgacgaggaggaggcggtctccgatgcggttttccgc 
gtcacggggatgacatgctccgcgtgcgctggatctatcgagaaagagatcaagcgtctt 
cccgggattcacgaagcggtcatcgacgctctcaacaaccgggctcaaatcpagttctac 
cctacctctgtcgatgtggagacgattcgtgagactattgaagatgctggatttgaagca 
tcactgatagaaaacgaggcgaatgagaggtccagacaagtatgcaggataagaataaac 
ggtatgacttgtacctcatgttcttcatccattgaaagcgtactgcaatcactttacggt 
gtacaaagagctcatgttgccttagcaattgaagaagctgaagttcattatgatcccacg 
ctcctgagctacgataaactactggaagagatagataatgctggatttgaagcgattctt 
ataagcacaggcgaagatgtgagcaagattgatttgaagatcgatggcgagtttactgat 
gaagcaatgacgatgattgaaagatcgcttgaagcacttcctggggttcagagtgttgag 
atcagccatggaagtgataagatatctgtgttgtacaaacctgatgtgacggggccgagg 
aacttcattcgtgtgatagagtctactgtctttggtcatagtggtcacatcaaggcaaca 
gtattctccgagggaggggtgggcagagagtctcaaaagcaagaagagatcaagcagtac 
tataagtcgtttctctggagtttggtttttacggtaccagtgtttttgacagccatggtc 
tttatgtatatccctggaattaaacatttgctaatgtttaaggtcatcaatatgctcacc 
gttggagaaatcataaggtgggttttggctacacctgtacagtttatcatcggctggaga 
ttctatgttggctcttacaaggctttacgccgaggatcagctaacatggatgttctgatt 
gctctgggaacaaatgcagcttatttctattcgttatacacagtgttgagagctgcaaca 
tctcctgatttcaagggagtagatttctttgagactagtgccatgctcatttcgtttatc 
atactaggaaagtatctggagataatggcaaaaggcaaaacatctcaagcgatcgcaaag 
cttatgaacttggcaccagacactgcgatattgttgactgtggacgaggaagggaatgtg 
actggtgaagaagagattgatggccgattgatacagaagaacgacgtgatcaagatcgtt 
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cctggtgctaaagtagcttccgatggttatgttatatggggacaaagtcatgtgaatgaa 
agtatgataactggagaggcaaggccagtggcaaagagaaagggtgatactgttatagga 
ggcacactgaacgagaacggtgttctgcatattaaggtgacaagggttggttcagagagt 
gctcttgcacagattgttcgacttgttgaatccgcccagctagccaaagctccagttcag 
aagttggctgatcggatatccaagttctttgttcctctggcacaactacttcttaaactt 
acgaattataatattgcagctggaaaactgcattggtaccctgaatcatggataccttct 
tcaatggatagctttgagctagctcttcagtttggaatctctgtcatggtcatagcttgt 
ccatgtgctcttgggctggctactccaaccgctgttatggttggaactggggttggtgca 
tcccaaggtgtgctgataaagggtggtcaagctctagaaagagcacacaaggtaaattgc 
attgtatttgacaagacaggaactctcacgatggggaaacccgttgttgttaaaaccaaa 
ctcctgaaaaacatggtacttcgagaattctatgaacttgtggctgcaactgaggtaaac 
agcgagca t cca 1 1 ggcaaaggcca 1 1 gt t gaat a t gcaaagaaat t cagaga t gacgaa 
gagaaccctacgtggcctgaagcccgtgattttgtgtctatcactggaaccggagtgaga 
gcgactgttaaaggaagagagattatggtgggaaacaagagcctcatgtctggttacaaa 
gttactattacagccgatgctgaggagttgctagcagaagctgaagagatggcccagaca 
ggaattctcgtgtctatagacaatgagttaactggagttctagctgtttcggatcctgta 
aaaccgagtgctcgagaagccatctcaattctaaaatccatgaatatcaaaagcatcatg 
gtaacaggtgacaactggggaactgcaaactccattgctagagaagtcggtatcgactct 
gttatcgcagaagctaaacccgagcagaaagcagagaaaatcaaggaactacaggctgcg 
ggtcatgttgtggcgatggttggagacggagtcaatgactcacctgctctcgtggcagcg 
gatgtaggaatggccattggtgcaggaaccgacattgcaatagaagcagctgatattgtc 
ctgatgaaaagcaacttggaagatgtgatcacagccattgatctatcgaggaaaacgttc 
tcaaggatccgtctcaactacgtatgggctctcgggtataacctcatgggcataccgatc 
gctgcgggggtgcttttcccgtctactcggttcaggttgcctccgtggattgcaggtgct 
gcaatggctgcttcttctatgctgagaaatggtccgagatttgtttctttactcgtgtcc 
tttggatgcattttatttgaagatgatgctttagttttacgcatcaggcgcaataaagct 
gaaataataagcaaccgcaacggcccatcattaaaagccactcgcattaacagcgctggg 
atgacgaactctaactacactgaattgaatgagctttacaacaagtacaaagacaaaggt 
ctggagattctagcatttccttgtaaccaatttggtgaagaggaacctggaactactgac 
caaattacagagtttgtctgtacccgcttcaaatctgagttccccattttcaacaagatt 
gaagtgaacggagagaatgcttcccctctttataagttcttgaaaaaaggcaaatgggga 
atcttcggggacgagattcagtggaactttgccaagtttcttgttgacaagaatggtcaa 
gctgttgaacgttactacccaactacttctcctcttacactagagcatgacatcaagaag 
cttctgaacctctcatga 
(SEQ ID NO: 12) 

complement ( join(36226. .36231,36662. .36706, 36908 37048 , 
37163. .37321,37422. .37553,37658. .37725,38166. .38634, 
39144. .39183) ) 
/gene* n Pept i de_7 " 

/note="GenScan; Pl«=Prom, P2=0.859, P3<=0.993, P4=0.976, 
P5«0.861, P6=0.821, P7=0.640, P8=PlyA" 
predicted_j)eptide_7 1 337_aa 

MSFPHYYLQRGAFl^SPSQVASGLHAPPPPGMRPMPNPNVHHPQAN^ 
SDFGHNIHMGMASSASVQPLQPPQI^QPLVKKKRGRPRKYAPPDGQVSLGLSPVPCSSGG 
S SKAAKDS SAVTDPN^KRARGRPPGTGRKQRLAN SVGAGED 
I VSKVMS FSQQRPRALC IMSGTGTVS SVTLRQPATTES SLTYEGRFEILSLGGS YLVNEE 
GGSKSRTGGLS VSLSGPEGHVI GGGIGMLIAASLVQVVACSFI YGGVNNSNKTI KQENKP 
KEEHKDSEMETNPQEEEEEEEAQLLKQQQQQQHTCRM 
(SEQ ID NO: 13) 

predicted_CDSj7 | 1014_bp 

atgtcatttcctcactactacctccaaagaggagccttcaccaatctctccccttcccag 
gtcgcgagtggtctccacgcgccgccgccgccgggaatgagacctatgccgaaccctaac 
gttcatcatcctcaagctaacaacccgggtcctcatttctccatggccgaccacagacac 
tctgatttcggacacaacattcacatggggatggcttcctctgcttctgtgcagccgctg 
cagccgccacagatggagcagccgttggttaagaagaagcgtggacggccgaggaagtac 
gctcctcctgatggacaagtctctttagggctttctcctgtgccttgttcttctggtggt 
agtagtaaagcagctaaggactcttctgcagtgactgatccaaatgctcctaaacgagcc 
agaggtcgacctcctggtactggaaggaagcaacgtttagctaatcttggtgagtggatg 
aacacttcagctggacttgcttttgcacctcatgtcatcagtgttggagcaggagaagat 
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atcgtttcaaaagtcatgtcattctcacagcaaagacctcgagctctttgtataatgtca 
ggcaccggaacagtttcttcagtcactctacgtcaacccgctacaacagagtcttctttg 
acatatgagggacgttttgagatactaagtctagggggatcttatttggtgaatgaagaa 
ggtggatccaaaagccgaacaggcggtttgagtgtctccctttctggtcctgaaggtcat 
gttattggcggtgggattggaatgcttattgcagccagcctcgttcaggttgtggcttgt 
agttttatatacggaggagttaataacagtaacaagactatcaaacaagaaaataaacca 
aaggaagagcacaaggatagcgaaatggagactaatccacaagaagaagaagaagaagaa 
gaagcacagctattgaagcagcagcagcagcagcagcatacgtgcagaatgtga 
(SEQ ID NO: 14) 

gene join(40484. .40523,40553. .40692,41204. .41375,43081 . .43086, 

43211. .43216) 
/gene=»Peptide_8 ! ' 

/note="GenScan; Pl=Prom, P2=0.955, P3o0.388, P4=0.167, 
P5«PlyA" 

predicted_peptide_8 1 105_aa 

MGIRGPNRVSVLSIQISVFRVYQNQPYSDYLKVRFGTGSGFIGFGSGVIiNVSGSIGYPFS 
^FGFGTDNTHNPKYHETRSIRYLCRVRIGSDSFLSDRVRFEFSDSN 
(SEQ ID NO: 15) 

CDS predict ed_CDS_8 1 3 1 8_bp 

atgggtattcggggtcctaatcgggtttcggttttatccattcagatttcggtttttcgg 
gtttatcaaaaccagccctattcggattatttgaaagttcggttcgggaccggttcgggt 
tttatcgggttcgggtcgggggttttgaatgtttcgggttctatcggatatccatttagt 
ttcgggttcggtacggataatacccataacccgaaataccatgaaacaagatccattcgg 
tatttatgtcgggttcggatcggttcggattcatttttatcggatcgggttcggttcgaa 
ttttcggattcgaattaa 
(SEQ ID NO: 16) 

gene complement (join (43579 . .43584,44345. .44516,44776. .44966, 

45056. .45151,45243. .45341, 45418 . .45510, 45622 . .45957, 
46034. .46247,46267. .46398,46486. .46544,46767. .46843, 
46952. .47162,47257. .47373,47455. .47541,47903. .47998, 
48102. .48167,48264. .48446,48537. .48686,48982. .49164, 
49324. .49410,49529. .49690,49956. .50092,50180. .50237, 
50366. .50488,50659. .50716,50874. .52171,52360. .52399) ) 
/gene= n Pept ide_9 11 

/note=«GenScan; Pl=Prom, P2=0.869, P3=0.998, P4=0.867, 
P5=0.889, P6=0.935, P7=0.741, P8=0.896, P9»0.998, 
P10=0.617, Pll»0.999, P12«0.999, P13=0.994, P14«0.758, 
P15=0.999, P16=0.993, P17=0.879, P18=0.954, P19:=0.893, 
P20=0.891, P21=0.906, P22=0.667, P23=0.951, P24«0.999, 
P25=0.984, P26«0.958, P27=PlyA" 
predict ed_peptide_9 1 1494_aa 

MGKGRAKAVEKRVLDQKLRGSVNVPSGPVYYPTEDEFKSPLDYIHKIKPEAEAYGICKIV 
PPKSWKPPFGLDLESVRFPTKTQEIHRLQFRPASC^SK?rFQLEYG^ 
VFEGGDLDLCKVFNAVKRFGGYDKWKGKKWGEVYQFMS SGEKI SKCAKHVL 
HEFEKYHGMMSADPSAKGHKRNRRCSEFSSSKRRKRNNGEKNHKVESEEEEVDQACEQCK 
SDKHGEVMLLCDSCNKGWHI YCLS PPLKHI PPGNWYCLDCLNTDEDTFGFVPGKCLLLED 
FKRIADRAKRKWFGSGPVSRTQIEKKFWEIVEGSGGDVEVMYGNDLDTSVYGSGFPRIGD 
RRPES VEANVWDEYCHS PWNLNNMPKLKGSMLQAI RHNINGVTVPWLYLGMLFS AFCWHF 
EDHCFYSVNYHHWGEAKCWYGVPGSAASAFEKVMRKALPDLFDAQPDLLFQLVTML 
LQENKVPVYTVLQE PGNFVI TFPKSFHAGFNFGLNCAEAVNFATADWLPYGGYGAELYRL 
YRKPAVISHEELLCWAKGNSCDGKGSIY^ 

LPECPDSVGIEEDTTCIICQQFLHLSAIVCSCRPSWACLEHWKHLCECEPTKLRLVYRY 

TIjAELHGMVQEVEKFGDCKTQETKNSQRPSSGTKRSGASNKKEC^IQVCQARPAEDWLRRA 

S KVLQDVFSSDVYATLLKEAEQFLWGGSEMNRVRDVAKSL I KAKI WAEAVGDCLSKVEGK 

GNNDTEKVHLEFIDELLKVDPVPCFQSGYLKLKDFAEEARKLSEKIDCSLSSSPTIAQME 

LLHSEVSSSPVSLSKHKILSKKISSAKMLAKRARCYLAASKPPGIELDELFKXKSEIL^ 

QVTLTETEGILDLLKKSELARDKCSK^SGFISLKNVEDLVHEFDGLCSINIRELNILRQ 

YHVDALSWISRFDDTMADVREGKDQRKLISDLSSLLQDGASLGIQVSEIVEGLPLVEVEL 
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KKAS CREKAQTVYAARS SLDFI EQLLSEAVVLQI EEEKLFVEI AGTLSTARFWEERAS S I 
LASETQMSDLKELVHPCLVFLKSFVHQKXIDVRRMSVNIGAVLPSLKGIENT 
NSEPFLSAVSSTASSPCSLLELPVLKDLVAQSKSLGVQLEEPRILETLLLSCERWQCDNH 
PLLQETEDIJ^DTAKTDIKSKHSTILPK:^ 

LGWCYKTILLGSSSPSPEIPEDLGKPSLQKIQQHLEAVCSDNNINFIiRLGQTLKILPEEY 
HLGKRLVELKDTGQEWAKRARKVVTDSGALALEDVFELI S EGENLPVI AEEELQALRARS 
MLHOTCLKPYNSRFMVSCSQCGEWYHTYCVKLHWRPEAWCFACCPPAESSPKNDPSRSM 

E PKT PSLDHRRARRWTGAAVGDLQWKSRKR I KRVAKRCLQVH I L PWF FTRE PK 

(SEQ ID NO: 17) 

CDS predicted_CDS_9 1 4485_bp 

atgggaaaaggtagagctaaggcagtagagaaaagggttttagatcagaagctgagagga 
tcggtcaatgtcccgtctggaccggtttattacccgaccgaagacgaattcaaaagccct 
ctagattacatacacaagatcaagcccgaggcagaggcttacgggatctgcaagattgtt 
ccaccgaaaagctggaaaccgcctttcggtttggacttggagtctgtcaggtttccgaca 
aagacgcaggagattcaccggttgcagttccgtcccgcttcttgcaactccaagacgttt 
cagctggagtacgggaggtttgtggaggagcgtttggggaagaaggtgaagaagagggtg 
gtttttgaagggggtgatttggatctgtgtaaggtgtttaacgcggtgaagaggtttggg 
ggttatgataaggttgtcaaggggaagaaatggggtgaggtttatcagtttatgagctct 
ggtgagaagatctccaagtgtgctaagcatgttttgtgtcagttgtataaagagcatttg 
catgagtttgagaagtatcacgggatgatgagtgcggatccttctgcaaagggtcacaaa 
aggaatcgacggtgttctgagttctctagctcgaagagaaggaaaagaaataatggcgag 
aaaaatcataaggtggaaagtgaagaagaagaggttgatcaggcatgtgagcagtgcaaa 
agtgacaaacatggtgaagtgatgctcttgtgtgatagttgtaataaaggttggcatata 
tactgtctctcgccgccgttgaagcatatcccgcctgggaactggtattgccttgattgc 
ttaaacactgatgaagacacttttggctttgtgccgggtaaatgtttgctacttgaagat 
ttcaagcggattgctgatcgcgccaaaaggaagtggtttggttcagggccggtgtctagg 
acgcagattgagaagaagttttgggaaatagtagaagggtcaggtggtgacgttgaagtt 
atgtatggtaatgacttggatacttctgtttacgggagtggttttcctagaataggtgat 
agaagaccagaatcagttgaggcaaatgtttgggatgaatattgccatagcccttggaat 
ctcaataacatgcctaagttgaaaggatctatgcttcaggccattcggcataacatcaat 
ggtgttacagtgccttggctatatcttggaatgctcttctctgctttttgttggcatttt 
gaggaccattgtttttactctgtgaattatcaccactggggagaagcaaaatgttggtat 
ggtgttccaggcagtgctgctagtgcttttgaaaaggtcatgcgaaaagccctacctgat 
ctctttgatgctcagccagatttgctctttcaactggttactatgttgagtccggctgtt 
ttgcaagaaaataaagtccctgtgtacacagtattacaggagcctggaaactttgtgatc 
acgtttccaaaatcctttcatgctggattcaatttcggtttgaattgtgcagaggccgtc 
aattttgccactgccgattggctaccttatggtggttatggtgcggagctgtataggctg 
taccgtaaacctgcagtcatatctcatgaagagcttctctgtgtggtagctaagggaaac 
agctgcgatggcaaaggatcaatatatttgaagaaagaactgctcagaatatatagcaag 
gaaaaaactttgagagagcagctttggaaaagtggtattttgagatcctctcctatgttt 
ctacctgaatgccctgattctgtgggcatcgaagaggatacaacatgcatcatctgccag 
cagtttctccatctttctgctatcgtctgcagctgcaggccatctgtttttgcatgcttg 
gagcactggaagcacctttgtgaatgtgaacctacaaaactgcgcctggtatatcggtat 
acccttgccgagttgcatggaatggtacaagaagttgaaaagtttggtgactgcaaaact 
caagaaaccaaaaattcacaacggccgagttcaggcaccaaacggtcaggtgcttctaac 
aaaaaggaggggatgcaagtttgccaggcacgaccagctgaagattggcttcgtcgagca 
tcaaaggttctccaggatgtcttttccagtgatgtatatgccactcttttaaaggaagca 
gaacagtttctttggggtggatcagaaatgaaccgcgtacgggatgttgcaaaaagtttg 
atcaaagcaaagatatgggctgaagctgttggcgactgtctttcaaaagtcgaaggcaaa 
ggcaacaatgatacagagaaagttcacttggagttcatcgatgagttgctgaaagttgac 
cccgttccttgctttcagtctggttatcttaaattaaaggactttgctgaagaggctagg 
aagttgtctgagaaaatcgattgttctctgtcaagtagcccaacgatcgcccagatggag 
ctattgcattccgaagtttccagttcaccagtctccctaagtaaacacaagatcttgtca 
aagaaaatatcttccgcgaagatgttagctaaaagggcgagatgctatcttgccgcttca 
aaacctcctggaattgagttggatgaacttttcaagctaaagtcagagatattggagctt 
caggtgacgcttacagaaacagaagggatcctggatttgctaaagaaatcagaactagcc 
cgtgataaatgtagcaaagttttgagtggttttatatctctcaagaatgtagaagatttg 
gttcatgaattcgatggcctttgtagcattaacattcgtgagctgaatatcctgaggcag 
taccatgttgatgctttgtcttggatttcacgctttgatgatacaatggctgatgttcgt 
gaaggcaaggaccaacgaaagctaatcagtgacctgagttcccttctccaggatggagca 
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tccttaggcattcaagtatctgaaatagttgaagggctccctcttgttgaggttgaattg 
aagaaggcatcttgtcgtgaaaaagcacaaacggtttatgctgcaagatcgtctctggat 
ttcattgagcagctgctctcggaagctgttgtactacaaatcgaagaggagaagctgttt 
gttgagatcgccggaactttgtctacagcgaggttttgggaggaaagagcaagcagtatt 
cttgccagtgaaactcagatgtccgaccttaaagaactcgtacacccatgcttagtattt 
ctcaagtcttttgttcatcaaaaagatgatgttcgtaggatgtctgtcaacattggtgcc 
gttttaccctctctgaaaggcatagagaacaccatttcgttggctgaaacttggcttcag 
aattctgagccctttttatctgccgtttcctctactgcctcatctccgtgttctctgctt 
gaacttcctgtgttaaaggacctggttgctcagtctaaatcgcttggtgttcaacttgaa 
gagccaaggattcttgaaacattactgcttagttgcgagaggtggcagtgtgataatcac 
ccgctcttgcaagaaactgaagatttgttggacactgcgaaaacagatgatggcaagcat 
agcacgattcttccgaagattatggacttgataaccagagtggactgcgccagaacatct 
ggtctgtcccttggtcttaatttggaagaacttcccaaacttcaaacggcaagtttaaaa 
ctaggatggtgttataagaccatcttgttaggctctagttcaccttcccctgagatacca 
gaagatctaggaaagccctcgttacagaaaattcagcagcacttagaagcggtatgttct 
gataataatatcaacttcctgcgcttaggacaaacactgaaaatattacctgaagagtat 
cacttaggcaagagacttgtggagctaaaagacactggacaagagtgggcaaaacgagct 
agaaaagtggtgacagactcgggtgctcttgccttggaagatgtattcgagcttatctct 
gagggtgaaaatttgcccgtcattgcagaggaagaacttcaggcattacgagctcgaagt 
atgttgcactgcgtttgcctgaagccatacaactcaagattcatggtttcttgtagtcaa 
tgtggcgaatggtatcacacctattgtgttaaacttcactggcggcctgaggcttatgtc 
tgcttcgcttgctgtcctccggcagaatcctctccaaaaaacgatccctccagatcaatg 
gagccaaagacaccgtcacttgaccatagacgggcaaggagggtagtgaccggtgcagca 
gttggtgatttgcagtggaaaagccgtaaacgcatcaaaagggtagctaaacggtgtctt 
caggttcatatccttccctggtttttcactcgagaaccaaaataa 
{SEQ ID NO: 18) 

gene join(52990. .53029,53942. .53981,55943 . .55978, 56068 . .56294, 

56387. .56522,56608. .56741,56834. .56885,57038. .57123, 
57238. .57426,57519. .57671,57849. .57950,58036. .58278, 
58289. .58294) 
/gene= " Pept ide_l 0 " 

/note= n GenScan; Pl=Prom, P2=0.308, P3=0.600, P4=0.976, 
P5=0.934, P6=0.988, P7=0.510, P8=0.940, P9=0.965, 
P10=0.876, Pll=0.964, P12=0.978, P13=PlyA" 
predicted_peptide_10 1 465_aa 

MFQYVSCIVQCGYEPQESNRGLTSSGGDDRIGGEGNDVPQFREFSIBTLRNATSGFSTEN 

IVSEHGEKAPNVVYKGKLDNQRRIAVKRFNRKAWPDSRQFLEEAKAVGQLRNYRM 

CCYEGEERLLVAEFMPNETLAKHLFHWESQPMK^ 

DL^YRVLFDDDSNPRLSCFGLMKNSRDGRVTPESVMYSYGTLLLDLLSGKHIPPSHALD 
LIRDRNIQMLIDSCLEGQFSSDDGTELVRIASRCLQYEPRERPNPKSLVTAMIPLQKDLE 
TPSHQLMGI PSSASTTPLS PliGEACLRTDLTAI HEI VEKLGYKDDEGAATEMWTNQMQDS 
LNFKKKGD VAFRHKDFANAAECYSQF I EGGTMVS PTVYARRSLCHLMNEMPQEALNDAMQ 
AQVI S PAWHI AS YLQAVALTALGQENEAHAALKDGSMLESKRNAL 
(SEQ ID N0:19) 

CDS predicted_CDS_10 1 1398Jbp 

atgtttcaatatgtttcttgcattgttcaatgtggctatgagcctcaagaatccaatcgt 
ggactcaccagctctggtggtgatgataggataggtggtgaagggaacgatgtgcctcag 
tttcgtgaattctctatagagacgctaaggaacgctacgtcagggttttctacagagaat 
atagtatcagagcatggtgagaaagctcccaatgttgtctacaaagggaagttggataat 
cagagacgtatcgctgtcaagaggtttaacaggaaagcttggcctgattctcgtcagttc 
ct ggaggaagctaaagctgttggtcagttaaggaactataggatggctaatctgcttgga 
tgttgttatgaaggtgaagagagacttcttgttgctgagtttatgcctaatgaaactttg 
gctaagcatcttttccactgggagtcacaaccgatgaagtgggcaatgagactaagagta 
gctttacatattgctcaagctttggagtactgtacaggcaaagggcgtgcactctaccat 
gaccttaatgcttatagagttctctttgatgatgactcgaatccaaggctttcttgcttt 
ggtctgatgaaaaatagtagggatggtcgcgtgacaccagaaagtgtgatgtacagttat 
ggaactctgttgcttgatcttctcagtggaaaacacattcctccaagccatgcgctggac 
ctcataagggacaggaacattcaaatgttgatagactcatgtttggagggtcaattttca 
agtgatgacgggactgaactggtacggttagcttctagatgcttacagtatgagcctcga 
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gaacggcctaacccaaaatctctagtcactgcaatgatccctcttcagaaggatcttgag 
actccttcacatcaactaatgggcataccaagcagtgcctcaacaacgcctctttcacca 
ctcggagaagcatgcctaagaactgacctaactgccatacatgagattgttgaaaaactt 
ggttataaagatgatgagggagcagccacagagatgtggaccaaccagatgcaggactcg 
ctgaacttcaagaaaaagggtgatgttgctttcaggcataaagactttgcaaatgctgct 
gaatgttattctcagtttatagagggtgggacaatggtttcaccaactgtttatgcaagg 
agaagtctgtgtcacctgatgaatgagatgcctcaagaggcgttgaatgatgcaatgcaa 
gcccaagtgatatctcccgcttggcatatcgcatcttatcttcaagctgtagctctcaca 
gctctaggacaagagaacgaagcacacgctgctcttaaagacggatcaatgctcgaaagc 
aaaagaaacgctctatga 
(SEQ ID NO: 20) 

gene join (58757. .58796,59699. .60947,61340. .61469,67106. .67244, 

67265. .67376,69364. .69578,69988. .69993) 
/gene = n Pep t ide_l 1 n 

/note=»"GenScan; Pl=Prom, P2=0.056, P3=0.008, P4=0.354, 
P5=0.298, P6«0.137, P7=PlyA" 
predicted_peptide_ll ) 614_aa 

MASQCSENVI PTLRDFFNSQKPGEEDEFMVHQVLNSSSKNI PPRPTKLGNKAKERHGLFN 
QGRII^RRNISDAEKFSVEQYSSSGFFGVRPNTNGRQQQQQRSAKPLGSDRNMEPRIiQKS 
FSARMQLPFMLSSKPSNQSTNNSSWFSRIKKMSNPFSNRNSLIPKSGEIKVSGVGETLSR 
NKSSSPVHLHAHLSIQHELGMPVFTFSLDHPDDVYTARTWMDVNDSRFVYSFR^ 
KNLGEQRSNVSGIDSSIiIGQMQVSTQVSLEVEEEPYEDPVESAVSEFVLFDIARARRSGL 
KTEQLSRQNS VSDGLKHLQRQNSFSRGLTRSFS KHSENSAS S SSDPWPATDLHPGLE I AA 
WIKDSYSSSNNESFEYMKNSKLSRREMKVIVPSGNHGLPDAENSCPTPILQRWRSGSQL 
YKHSQYVWLSCTEQKYQEEKEATLCLGAVRLESLLTWQLRQPDECSSFSKPIQHNFAKLM 
RFIWNVPGGDGDNNLNGTCWLPSLPYMTQWQNAKAPSYSISSTKTPPPLQLLLPPPLNA 
IRHDQLAQPLLISPLIVINAFNGQECKITDRFKGFKEEETVKFYIYWRCLNRFLIRVATC 
CATHTVPATVQQNP 
(SEQ ID NO:21) 

CDS predicted_CDS_ll 1 1845Jbp 

atggcatcgcagtgttctgaaaacgttattcctactctacgtgatttcttcaactcccaa 
aagccaggggaagaggatgagtttatggttcatcaggtcctcaactcctcttccaagaac 
attccaccgagacctacaaaactaggaaacaaggcaaaagagagacacggtctcttcaat 
cagggaagaataagaagcagaagaaacatatcagatgctgaaaagttctctgtggagcaa 
tactcttctagtggtttctttggggtacggttcaacacaaacggaagacaacaacagcaa 
caacgatcagctaagcctttagggtcagatagaaacatggaaccgagattgcagaagtca 
ttctccgctagaatgcaactcccctttatgctatcttcaaagccaagcaaccagtctacg 
aataattcaagctggtttagccgtatcaagaaaatgtctaatccattttcaaatcgaaac 
tctctgataccaaagtcaggagaaatcaaggtcagtggagtaggagaaacactctcaaga 
aacaagtcttcttcacctgttcatctacatgcccatctcagtatccaacatgaacttggg 
atgcctgttttcaccttctctctagaccacccggatgatgtgtatacggccaggacgtgg 
atggatgttaacgactctcggttcgtctattcgtttcgttacattggtggtagaagcaac 
aagaacctcggagaacagaggtcgaacgtttcaggtatagactcttcacttataggacag 
atgcaagtttcaactcaagtctccttagaggtagaagaagaaccatacgaagatcctgtt 
gaatcagccgtgtcagagttcgttctttttgacatcgcacgagcacggagaagtggactc 
aagactgaacaactgtcaagacagaacagtgtatctgatggcttaaagcatctgcaaagg 
cagaacagtttcagcagagggttgactcgtagtttctcaaaacattcagagaacagtgca 
tcatcatcatctgatccttggccagccacggatttacatccaggtctagagattgcagca 
gtcgttattaaagactcttattcttctagcaataatgagagttttgagtacatgaagaac 
agtaaactctctagacgagagatgaaggttatagttccatcaggaaaccacggtttgcct 
gatgctgaaaactcatgtcctacaccgatactgcagagatggagatcaggctctcagctt 
tacaagcattctcagtatgtgtggctgagctgcacagaacagaagtatcaagaggagaaa 
gaagcgactctttgtctaggtgcagttcgcttagagagcttattgacatggcaactccgc 
caacctgatgagtgtagttcgttctccaagccgatccaacacaacttcgccaagctcatg 
cgtttcatctggaacgtaccaggaggagatggagacaacaatctcaatggtacttgttgg 
ttgccctcgttgccttatatgacccaaaatgtccaaaatgcaaaagcaccgtcctactcg 
atttcctcaaccaagacgcctcccccgctacaactgctcctgccgccaccactgaacgca 
ataagacatgaccaacttgcacagcctctcctaatcagcccattaattgttataaacgct 
tttaatggtcaggaatgtaaaatcacagaccggtttaaagggtttaaggaagaagagacc 
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gtgaaattttacatttactggagatgcttaaatcggttcttgatcagagtggctacgtgc 
tgcgcaacccacacggtgccagccacggtgcagcagaatccatga 
(SEQ ID NO: 22) 

gene join(73822. .73861,75318 . . 75606, 75677 . . 75984 , 76063 . .76139, 

76260. .76485,76586. . 76721 , 76935 .. 77101, 77273 .. 77408 , 
78119 . . 78240, 79010 . . 79015) 
/gene= n Pept ide_12 n 

/note= n GenScan; Pl=Prom, P2«0.951, P3=0.989, P4=0.507, 
P5=0.492, P6=0.997, P7=0.864, P8=0.418, P9=0.265, 
P10=PlyA" 

predictedjpeptide_l2 1 486_aa 
MEIEELELTPLRQPSPEMKQSWVLKPTLSKKKKKLSV^ 

VITLFKTLPHHQS I PPPQDNSTIALPMALKFFNAQISGKLPEGNNVSWRGNSCLNDGNFP 
GSLYPHLAGGYYDAGGS I KTS FTMSFSMTMLSWSVI EYGSKYEACGEVDHVKGLI KWGTD 
YFLRTFS SS SDTI YEMVYQIGMNQGSQVTSDLYCWMRPEDIDYQSAKAVYLYAEAMSAKR 
KSADHWDDLIWGGAWLYYATGDNSYLAKVTSHDLAN^ 

LLFTRLRLFLS PPF PYEEMLRVFHEQTS I VMCS YLPYYTKFNRTKGGL I LLSEPE PLQYA 
ANAAFLATLYSDYQGASDAPGWYCGPTFFKTEILRDFSTSQGSLDPEEQESDMRRRFEVE 
GEHERESKEIDRRGNGRWTRQEGWLPRRLLKESLIjNNI VSTQI CS ETEGDKF 1 1 SSQYCNT 
QWGKT 

(SEQ ID NO: 23) 

CDS predi ct ed_CDS JL2 1 14 6 l_bp 

atggagatcgaggagctcgagcttacacctctgagacagccttctccggaaatgaagcag 
agttgggttctcaaaccaacactctccaagaaaaagaagaaactttctgttgcttgcatc 
tcaagcaacaaaaggttacttgtatggcttggaggagtcttggctgtgtcgtctatcgtc 
gtgataacactgttcaagacactcccccaccaccagagcattccaccaccgcaagacaat 
agcaccatcgcacttcctatggcattgaaatttttcaatgcccaaatctccggaaaactg 
ccagaggggaataacgtgtcttggaggggaaattcttgcttgaacgatgggaattttccc 
gggagtctttacccacatctggcgggagggtactacgatgctggaggttcgatcaagaca 
agcttcaccatgtctttctcaatgacaatgttgagctggagtgtcattgaatatgggtcg 
aaatatgaggcttgtggagaagtggaccacgtcaaagggctcattaaatggggaaccgac 
tacttcctccgcacttttagcagcagttctgatacgatctatgaaatggtgtatcagata 
gggatgaatcaaggaagccaagtgactagtgacctatactgctggatgcgaccagaagac 
attgattaccaaagtgccaaagccgtttacctttacgccgaggctatgagcgccaagcgg 
aaaagcgcagatcactgggatgacctcatatgggggggagcgtggctctactatgccacg 
ggtgataactcgtatcttgctaaggtaaccagtcacgatctagccaaccgtgccggtgcc 
ttttcccatggccctcgttatggtgtctttggttgggacaacaagcttgctgggacacag 
ttgcttttcactcggttgaggctgttcttgagccctcccttcccgtatgaagagatgctc 
agggtctttcatgagcaaaccagcatagtaatgtgctcctacttgccgtattacactaag 
tttaacagaacaaaaggtggtttgatcctgctgagcgaaccagagcctctccagtatgct 
gcgaatgcagctttcctggccactctgtacagtgactaccaaggcgcttctgacgctcct 
ggatggtactgtgggccaactttcttcaaaactgagatcctacgtgacttttcgacatct 
caagggagcctcgatcccgaagaacaagaaagtgacatgcgaaggaggtttgaagtggaa 
ggagagcacgagcgagaatccaaacacgatcgaaggggcaatggtcgctggaccagacaa 
gaaggatggcttccacgacgtttactcaaggaatcacttctcaacaatattgtttccact 
caaatctgttccttcactgaaggagacaaatttataatctcttctcaatattgtaacaca 
caagtggtaggaaagacttga 
(SEQ ID N0:24) 

gene join(79734. .79773,86164. .86340,87980. .87985) 

/gene= n Pept ide_13 " 

/note= n GenScan; Pl=Prom, P2=0.178, P3=PlyA" 
predictedj?eptide_13 |58_aa 

MIYRFWPKARFSGFGGKSRFPVLAGNHDFPVLAEKMFSVFAEKKFSVLAENMISLFW 
(SEQ ID NO: 25) 

CDS predicted_CDS_13 1 177_bp 

atgatttatcggttttggccgaaagcacgattttccggttttggcgggaaatcacgattt 
ccggttttggcgggaaatcacgattttccggttttagcggaaaaacgtgatttctcggtt 
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ttcgcggaaaaaaaattctcggttttagcggaaaacatgatttctctgttttggtga 
(SEQ ID NO: 26) 

gene join (88073 88112, 91231 . .91439,91774. . 91973 , 92397 .. 92435 , 

93774. .93832,93930. .94033,94874. .94993,96369. .96467, 
97170. .97545,97582. .97587) 
/gene« n Peptide_14 " 

/note= n GenScan; Pl=Prom, P2=0.466, P3=0.372, P4=0.407, 
P5«=0.188, P6*0.901, P7=0.218, P8«0.351, P9=0.547, 
P10=PlyA" 

predicted_j>eptide_14 1 401_aa 

MEWPALASELDEI AALSKEFMSLS ICAI PRTQNARADGLAKGGRTRKLI PFVSDSAPTWL 

APE^LTAVELSPHLAIDLFSSSYRAFVIIDPVLGPWIKVKPTS 

I PI WMVCLTPKVYEKP I YI VMFLPS AS S FEKKLHRLKKLMQMCRNNI LKRAPNPPSTTLI 

SYLIKLEITNTRINFTDNAFKFQRERLIAPMSHLLRRKTVMAPEKSSQPPDM 

ESQSKRERTTEINDSIERHRRLQIFSTKKTITIMVGWFKELCLENGNPEAHYIVQYFVHK 

EKQTGLFHI^QSATRNNGNN^raL^ 

KNSLSAI PVPMEQRYYVNMVNLKPHT1TCDPNKIMAKVCKQCY 
(SEQ ID NO:27) 

CDS predi ct ed_CDS__14 1 12 0 6_bp 

atggaatggccagcattggcctctgagttggatgaaattgcagcactgtctaaagagttt 
atgagcctctctatatgcgcaatcccgagaactcagaatgcccgagcggacggcttggca 
aaagggggacgtacacgcaagctcatcccgttcgttagtgactccgcacctacatggcta 
gcccctgaagctagtctaacggctgtagaactatcccctcatcttgcgatagacctattt 
tcttcttcttatcgggcctttgtaataattgacccggttcttggaccggtatggattaag 
gtgaaacctacctccaataataatcatccagtcctcggtcatcagccaggtcaggtgttt 
attccgatttggatggtatgtttgacgccgaaggtttatgaaaaacccatttatatagtg 
atgtttttgccttcggcatcgtcttttgagaagaaattgcacagattgaagaagttaatg 
cagatgtgcagaaacaacatccttaagagagctccaaatcctccatctacgacccttatc 
agttacttgatcaaacttgagatcacaaatacacgcatcaactttaccgacaacgctttc 
aagttccaaagagagagattgattgctcccatgtctcatctgttgagaagaaaaactgtc 
atggctcccgagaaatcctctcagccgccggacatggagacaaagttggatctagagcca 
gagagtcaaagcaaacgtgaaaggacaaccgagatcaatgatagtatcgagagacaccgg 
agactgcagattttttcaactaaaaaaacaataacaatcatggtcgggtggttcaaagaa 
ttgtgtttagagaacggcaatcctgaggcgcactatattgttcaatacttcgtccacaaa 
gaaaaacagacaggcctctttcatctacgccaatcagccactaggaacaatggaaataat 
atgcatctttacgttttgttaatgcttgctgaaggtcactatcaaaccggtaaaaaatat 
ttggataaactccaatggaaaaagaaacgatcaacctccgatcattgctgtgaaagaatc 
aagaactcactaagtgcaatacctgttcctatggagcagcggtactatgtaaatatggtc 
aaccttaagccacatacaaattgcgatcccaataacatggctaaagtatgcaagcaatgt 
tactaa 
(SEQ ID NO: 28) 

gene join(97759. .97798,99055. .100012,100273. .101111, 

101293. .101298) 
/gene= n Pept ide_l 5 " 

/notes "GenScan; Pi^Prom, P2=0.952, P3=0.971, P4=PlyA" 
predicted_peptide_15 1 598_aa 

MRIGVSEPEQCGCDTCVQHRTLCTQETEPSKEVTGSSVPVSSEPVQRLGSTSDQCSGTHT 
TPLAPPEPAAQSVDASSTSSSIFSSVSSQPARALCPTGSLPVPLFGCSWPRPCSCTGCSL 
LGPS IRRSSPFFTASSGSSI SSSRQANVTNSFGSAASEPSVSGPMKAPIFTSGSSTASTS 
STLPSLVTPSDITRGSVQAPVQANTSKTASDFHPPNVANTGVCAASRTSTNNPFPGFSVD 
YLPRCPSNLSRPNAPTTTPVPGPSSVLAGGETEQGSRYPRYAPTPDVDGKQIISISASNS 
HGHKSHEEIiRWEDYKNGDKAGVGSFPPPDHTPSVFTPPS I PDRPRMRTIDLTNRDTSGFP 
IGYNTPAAFQSPHEPVGVSSPASGCTACGAASSSSPSSHLGLNSTTNPPSSATSLPGLFF 
STYGSFPLLFATPNIiAAYGTTPAVQAYPMMFGI PNLAAQGTATPSVQAYPMI FGI PNLAA 
QGTTATPAFQAYPMI FGI PNVAAQGTTTTTPAAQAYPMMFGI PNLAAQGTTTPAAQPYPT 
MFGTPSLAAQGTTTAPAVQPYPWGTPNFVAQGMTPAAQAYPVNGSSLLPFAAMSLQ 
(SEQ ID NO: 29) 
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CDS predicted_CDS_15 1 1797_bp 

atgaggattggtgtttctgagccagagcaatgcggttgcgacacgtgtgtccagcatcgc 
actttatgcactcaagaaaccgaacctagcaaagaagtgactggctcatcggttcctgtt 
agttcagaaccagttcaacgtctcggttccacctcagatcagtgttccggaacacatacg 
actccactcgctcctcctgaaccagcagcgcagtctgttgatgcatcctccacatGatcc 
tccatattcagttctgttagttcccaaccagcgcgagctctttgtcccaccggttcactg 
ccagttcctttatttggttgttcatggccacgtccttgtagctgtactggttgttcactg 
ctaggtccttctattcgtcgttcatctcccttctttactgcttcttccggatcatcaatc 
tcctcgtcaaggcaagccaacgttacaaacagtttcggttccgctgcatccgaaccatct 
gtatctgggccaatgaaagctcctatttttacttctggctcttcaaccgcttccacatcc 
tcaactctaccttcattagttactccctcggacattacaagaggatcagtgcaagcgcct 
gtccaagctaacacttccaagactgcttctgattttcatccacctaacgttgccaacact 
ggagtttgcgctgcttcaaggactagcacgaacaatccatttccaggatttagtgttgat 
tacttgcccagatgtccctctaacctttctcgaccaaacgcaccaactactacaccagtt 
cctggccctagttcagttttggctggtggtgaaactgaacaaggtagtaggtatcctcgt 
tatgcgcctacaccagatgttgacggcaagcagattatttccatatctgcttccaactca 
catggacataaaagtcatgaagagttgaggtgggaagattacaaaaatggagacaaagct 
ggggttgggtcgtttcctcctcctgatcatacaccgtcggtgtttactcctccaagcata 
cctgatcgtcctcggatgagaactattgatctaacgaaccgagacacgagtggttttcct 
attggctacaacacccccgctgctttccagagcccccatgaacccgttggtgtttcttcc 
ccagcatcaggatgcacagcgtgtggagccgcgagtagctcctctccttcgagtcacttg 
ggcttgaacagtaccacaaatcctccatcatctgcgacatctcttcccgggctgttcttt 
tccacctatggttcttttcctttgctgtttgcaacaccaaatcttgcagcttatggtaca 
actccagcagtccaagcctatcctatgatgtttggaataccaaatcttgctgctcaaggt 
acagcaactccatcagttcaagcgtatcctatgatatttggaataccaaatcttgctgct 
caaggtacaacagcaactccagcttttcaagcgtatcctatgatatttggaataccaaat 
gttgctgctcaaggtacaacaacaacaactccagcggctcaagcgtatcctatgatgttt 
ggaataccaaatcttgctgctcaaggtacaacaactccagcggctcagccctatcctacg 
atgtttggaacaccaagtcttgctgctcaaggtacaacaacagctccagcagttcagccc 
tatcctacgatgtatggtacaccaaattttgtagctcaaggtatgactccagcagctcag 
gcctatcccgttaatggttcaagtcttctcccatttgccgccatgagtctgcagtaa 
(SEQ ID NO:30) 

gene join( 102918. . 102957,103375 . .104497, 104589. . 105589, 

106421. .106426) 
/ gene= 11 Pept ide_l 6 " 

/notec"GenScan; Pl=Prom, P2*=0.985, P3«0.925, P4=PlyA w 
predictedj?eptide_16 1 707_aa 

MRIDVSEPELCGCDTCVQHRTFITQETEPSKEVIGSSVPVSSEPVQPLGSTSDESSGTET 
TPLAPPPVTTPVNNPEPAAQSVGSTI PPAVTPVSSEQPAQALGSTSDQSSGTETTPIAPP 
I TTSVKSVDST IFPKFP PVQAQALAPTASGSTQAPAFGFGAFAARVPSATSGCSAFS FAP 
PVTSAPVQALGTTTTTTTTTSAAAPASPFHSSSPTTFQ 

PLDAPPSPFRWGSLQANTSPPFSFLPAQGSDKTGSAFTPPFGYPGGFARPDVGVSHPGFG 
PSNHFGPNAPTTTPVPVRS PFLAGGGTEQGSRYPRYS PTPDVDGRLIMS I SASNSHGHKS 
HEELRWEDYKNGDKGGFGWFPPVHTS PFS S PTVS PSLFAPPS I PNRPQMRTI DLTNRDMC 
GFPIGYNTPAAPQRPPEPAGVSSPASGCTACGATSRSSPSSHLGLNNTTOPPSAATSIiPG 
MFFSTYGSCPLLFGSPNLATYGTTAIPAVQAYAIMFGAPNFTSQGTTATPAFQAFPIMFG 
TPNLAAQGTTRAPAVQAYPTMFGTPNI^ 

TTPAAQAYPLMFGTPNLAAQGTTTPAVQSYPTMFGTPNLAGQSTTTTRAGQPYPTTFAVP 
QAATAPAVQPYAMMFGTPSLGAQDITPGGQAYPAHGLTLPFAAMSLQ 
(SEQ ID NO:31) 

CDS predicted_CDS_16 1 2124Jbp 

atgaggattgatgtttctgagccagagctatgcggttgcgacacttgtgtccagcatcgg 
actttcattactcaagaaaccgagccgagcaaagaagtgattggctcatcggttcctgtt 
agttccgaaccagttcaacctcttggttccacctcagatgagagttcaggaacagagacg 
actccactcgctcctcctccagtcaccacaccggttaataatcctgaaccagcagcgcag 
tctgttggctcaaccatcccacctgctgttacaccagttagttccgaacaaccagcacaa 
gctcttggttccacctcggatcaaagttccggtacagagaccactccactcgctcctcct 
atcaccacgtcggttaagtctgttgactcgaccatcttcttcaagttcccaccggtacaa 
gcacaagctcttgcccctactgcttccggttcaacgcaagcccctgcttttggttttggt 
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gcattcgctgctcgcgtaccatctgccacctccggttgttcagcatttagtttcgcccct 
cctgttacatcggcaccagtgcaagctctaggcacaaccactactactactactactaca 
tccgcggccgctcctgcatctccatttcacagttcctcaccaaccacattccaattccct 
cctgcttttacatcccttgctgcttctacttttccttctgttgcatcatcaacttcatct 
ccacttgatgctcctccctcaccatttagatggggatcactgcaagctaacacttcccca 
ccctttagcttcttgccagcgcaaggttctgacaagactggttctgcttttactccaccg 
tttggctaccctggtggttttgccagacctgatgttggtgtctctcatccagggtttggt 
ccctctaaccattttggaccaaacgcaccaactactacacctgttcctgttcgcagtcca 
tttttggctggtggtggaactgaacaaggtagtaggtatcctcgttattcacctacacca 
gatgttgacggcaggctgataatgtccatatctgcttccaactcacatggacataaaagt 
catgaagaattgaggtgggaagattacaaaaatggagacaaaggtgggtttgggtggttt 
cctcctgttcatacatctcccttttcctcaccaacggtatcaccgtcgctatttgctcct 
ccaagcatacctaatcgtcctcagatgagaactattgatctaacgaaccgagacatgtgt 
ggttttcctattggctacaacacccccgctgctttccagagaccccctgaacccgctggt 
gtttcttccccagcatctggatgcacagcgtgtggagccacgagtaggtcctctccttct 
agtcacttgggcttgaacaataccacaaatcctccatcagctgcgacatctcttcccggg 
atgttcttttctacctatggttcttgtcctttgctgtttggctcaccaaatcttgcaact 
tatggtacaacagcaattccagcagtccaagcctatgctattatgtttggggctccaaat 
tttacttctcaaggtacaacggcaactccagcttttcaagcctttcctattatgtttggg 
actccaaatcttgctgctcaaggtactacaagagctccagctgttcaagcctatcctacg 
atgtttggcacgccaaatattggagttcaaggttcaactccagcagctcaaacctatcct 
ttgatgtttggcaccccaaatcttgctgctcaaggtacaacaaatattggagctcgaggt 
acaactccagcagctcaagcctatccgttgatgtttggcaccccaaatcttgctgctcaa 
ggtacaacaactccagcagttcagtcctatcctacgatgtttggaacaccaaatctagct 
ggtcaaagtacaacaacaactcgagcaggtcagccatatcctacgacgtttgctgttcct 
caagctgcgacagctccagcagttcagccgtatgctatgatgtttggtacaccaagtctc 
ggagctcaagatatcactccaggaggtcaagcctatcccgctcatggtttaactctccca 
ttcgccgccatgagtctgcagtaa 
(SEQ ID NO:32) 

9©ne join(106892 . . 106931, 106945 . .107669,108022. .108118, 

108693. .108698) 
/gene= n Pept ide_17 " 

/note="GenScan; Pl=Prom, P2«0.909, P3=0.967, P4«PlyA" 
peptide_17 | 273_aa 

MARIITTLSTPLFFFFLFSLLSHQTISQPEHLTTFCNPSDNFTQTSSYEANRDLLLSSLR 
VS S S LGTYSNATVGRS PNTVHGMFLCRGDTTAAS CSDCVQTAT I E I ATNCTLNKEAV I YY 
E ECMVRYSNVS FFS VLEVRPS I VLYSLRS APNSNTLNETLADKFNQL I LNVS S S S IiVP YF 
LEDQELVTQAEGS YKFESMVQCS PGLDRFNCTVCLRFALLRVSTCCGS PSSALI FTPKCL 
LRKPNI WENCYDDGSFGVCTCEFVNTFDDFGLK 
(SEQ ID NO: 33) 

CDS CDS_17 ) 8 2 2 Jbp 

atggcaagaatcataactacactctctacacctctgtttttcttcttcctcttctccctg 

ctctctcaccaaaccatttctcaacccgaacacttgactactttctgcaacccttccgac 

aacttcacacaaaccagttcatacgaagcaaaccgagaccttctactctcctctctccgc 

gtcagttcctccctcggaacctattcaaacgccacagtcggtcgtagtcccaacacagtc 

cacggcatgttcctctgcagaggagacaccaccgcagcgtcttgctcagactgcgtccag 

accgctacaatcgagatcgctacaaactgtactcttaacaaagaagcggtcatatactac 

gaagagtgcatggttcggtactctaatgtttccttcttctctgttcttgaggtcagaccg 

agcatcgtcctttactctcttcgctc'tgctccaaactcgaatacgctcaatgaaacgtta 

gctgataaattcaaccaactgattctcaacgtgtcttcgtcctctttggttccgtatttt 

ctggaagatcaagaacttgtgactcaagcagagggttcttataagtttgagtcaatggtt 

cagtgtagtcctggtcttgaccggttcaactgtaccgtttgtctcagatttgcgctctta 

agagtttcaacttgttgcggttcaccaagttctgctctgatctttactcctaaatgtctt 

ttgaggaaaccaaatatttgggagaattgctatgacgatggcagttttggtgtttgcact 

tgtgaatttgtgaacacatttgatgattttggattgaaataa 

(SEQ ID NO: 34) 

9 ene join(108921. .108960,109245. .109298,109434. .109506, 

111201. .111292,111576. .111581) 
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/gene= 11 Pept ide_18 " 

/note="GenScan; Pl-Prom, P2=0.821, P3=0.759 / P4=0.101, 
P5=PlyA" 

peptide^ 18 1 72_aa 

MBIHKSRFGSTSYKRPNQISKKYNLGVRVKFEIPFEPSDLILCYIPYPESFDIQLIALYY 
HFGFCDWLTRFC 
(SEQ ID NO:35) 

CDS CDS_18|219J>p 

atggagatccacaagtctcggtttggatctacgtcctacaagagacccaatcagatttcc 
aaaaaatataatttgggtgttcgggtaaaatttgagatacctttcgagttttcggatctg 
attttgtgttacataccttatccggaatcatttgatatacaacttatcgctttgtactac 
cattttggattttgtgactggctgacccgtttctgttag 
(SEQ ID NO: 36) 

gene complement {join (111677 . . 111682 , 112729 . . 113089 , 

114128. .114711,114812. .114975,115273. .115394, 
115445 . . 115631 , 115752 . . 115914 , 116003 . . 116109 , 
116203. .116382, 116469. . 116564 , 116659 .. 116740 , 
116823. .117164,117401. . 117430 , 117506 117598 , 
117691. .117768,117908. . 117991, 118068 . . 118316, 
118664. .118762,118968. . 119044 , 119232 . . 119322 , 
122268. .122307) ) 
/gene« n Pept ide_19 " 

/note= n GenScan; Pl=Prom, P2=0.472, P3=0.544, P4=0.999, 
P5=0.979, P6=0.999, P7=0.973, P8«0.890, P9=0.924, 
P10=0.879, Pll=0.978, P12=0.991, P13«0.943, P14=0.771, 
P15=0.475, P16=0.863, P17=0.628, P18=0.998, P19=0.996, 
P2 0=0. 3 84, P21=PlyA» 
peptide_19 1 1062_aa 

MSGVYAPSDVTSFLSFDGSESRSSLDGSKKGHQNLVEWLNQTLPYLNLPSEASEDEGGSF 

EPAYVKVERFLTA^EMALPRFEVSDIEQKVKMQGDMLPVFQSLKALKASFSDGGNDKN 

LGARRRWSLPEDHSDSRGDDRNFIDGFQSKEGFEIDTSDAKISELLKSDSIiRNAPTRTLF 

DMLDKLLDESVKKMNGHVSHAMASLLSALVQVIEQRISNQADl^KNQNILFRVREDKYRS 

RI KVLETLAAGATQENE I VSNCMERTKLEKSRI EEREKSEEKD WRLKKEKERSDAE I RK 

LKQELKWKEAHANQCLELEAKAQNSTVELESKLKDAEL^ 

KWENRE STYQSF IDNQFGALQALNATS VS I KQEVLRTQKKYFEDLNYYGLKLKGVADAAK 

NYHWLEENRRLYNEVQELKGNI RVYCRIRPFLPGQNSGQTS I EYI GENGEL WANPFKQ 

GKDTHRLFKFNKVFGQAATQEEVFLDTRPLIRSILDGYNVCIFAYGQTC 

I TS KEHWGVNYRALNDL FHLTQLRQNTVVYEVGVQMVE I YNEQ VRD I L S DENFLNLRTLG 

VWNTALPNGLAVPDASMHS WSTEDVLELMNIGLMNRTVGATALNERSS RSH^ I EMTLL 

PIVYSVLSVHTOGVDVETDSVLRGSLHLVDLAGRGQAKTLMFVQVNPDGDSYAETV 

F AER VS GVE LGAAKSNKEGRD VRHLMEQ VS S LKD VI AKKDEELQNVQKQ KSNSTTVP KRG 

LSNLRLLGPSSPRRHSIGPSPNARRGKAPGSFGRAASDFDNCSEYSSKHSDSGSPRSSDE 

LKHRKDLHQLSKFAGGSKEIDIEDDIELIGLGDADSEDRLSDISDSCLSMGTETDGSICS 

AVELTLF PETVKPLE ITEEPE PHLVPEKLEKSAKMVKTVPKDKTRTWS YQLFAS F YGPDV 

PKTFISHLRRQLNSKGILMFDDQGTERGQAIRGSVISIVVLSKNYASSSWCLKNLLEILK 

CKEQI VTTVFYGVDPSDVRKQTGDFGKAFKETCRGS KDGDKL 

(SEQ ID NO: 37) 

CDS CDS_19|3189_bp 

atgagtggcgtctatgctccctccgatgtaactagttttctcagcttcgacgggtctgaa 
agtcgttcaagcttggatggtagtaagaaaggtcatcagaatttggtggaatggttaaat 
cagacacttccttatttgaatttaccatcggaagcttcggaggacgaaggag;gcagcttt 
gagcctgcttatgttagagttgagcggtttctgactgctatggatgaaatggccctgccc 
agattcgaggtttcagacatagaacagaaagtaaaaatgcagggggatatgctgccagtt 
tttcagtcccttaaggcccttaaagcaagtttttccgatggtggtaatgataaaaactca 
ctaggtgcgaggaggagatggagcttgccagaagaccattcggattccagaggagatgac 
cgcaactttattgatggattccagtcgaaggaaggatttgagattgatacatcagatgct 
aaaatttcagaattactgaaatctgacagtttacgaaatgctcctactcggacactattt 
gacatgctggataaacttctagatgagagcgtgaagaagatgaatggacatgtgtctcat 
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gcaatggcatcactcttgagcgcacttgtgcaagtgatagaacagagaatctcaaatcaa 

gctgataacctgaaaaaccaaaatatactcttcagggtacgtgaagataaatacaggtca 

agaataaaggtcttagaaaccttggcagctggggcaactcaggaaaacgagattgtttcg 

aattgtatggagcgtacaaagcttgaaaaaagcagaatagaagaaagagaaaagtcagaa 

gaaaaagatgtggtgcgtttgaaaaaggaaaaagagcgcagtgatgctgagattcgtaag 

ctgaagcaagaactcaaggtggtgaaagaggcgcatgcaaaccagtgcttggagttagaa 

gcaaaagcacaaaacagtacagttgagttggagagtaaattaaaggatgcagagttacaa 

gttgccgaatcaactaggaaggtcaaagaactcgagaagttgtacctatctaaatctcaa 

aaatgggagaacagagagtccacctaccaaagcttcatagacaaccagtttggtgctttg 

caggctttgaatgctacttcagtgtctataaagcaagaagtcttaaggacacaaaagaaa 

tactttgaggacctaaattactatggtttaaagctcaaaggagtggctgatgcagcaaaa 

aattaccatgtggtccttgaagaaaaccgaagactgtacaatgaagtgcaggaattgaaa 

ggaaatatcagagtctattgccggataagaccattccttccggggcaaaacagtggacag 

acttctatagagtacattggtgagaacggtgaattggtggttgcaaatccgtttaagcaa 

gggaaagatacccatcggttgtttaagttcaataaagttttcggtcaagcagcaactcaa 

gaggaggttttcctagatactcgaccattaattcgatcaattcttgatggttataatgtg 

tgtatatttgcgtatggtcagacgggatctggaaaaacttatacaatgagtgggccaagc 

atcacttcaaaagaacactggggtgtcaattacagagctctgaatgacttgtttcactta 

actcagcttagacaaaacactgttgtgtatgaagtaggtgttcaaatggttgagatatac 

aatgagcaagttcgtgacatactttctgatgaaaatttccttaacttgcgtacattaggg 

gtttggaatactgccttaccaaatgggttagctgtcccagatgcaagcatgcattctgtg 

agatcaactgaagatgtgcttgagctgatgaatattgggctcatgaacagaaccgttggt 

gccacagctctcaatgaaaggagtagtagatcacactggattatagaaatgacattgttg 

cctattgtatacagcgttctttctgttcatgtacgtggtgtcgacgtggaaaccgattct 

gttttgcgtggtagtttgcacttggtcgatcttgctggaagaggacaggctaagactctt 

atgtttgttcaagtcaatcctgatggagattcttatgctgagacggttagcactctgaag 

ttcgctgaaagagtttctggtgtggaattaggtgcagctaaaagtaataaagagggacga 

gatgttagacacctcatggaacaggtatcaagcttgaaggatgttattgccaagaaagat 

gaagagcttcaaaatgttcagaagcaaaaaagtaacagtacaaccgtgccgaaacgtggt 

ttaagcaatctaagattgttggggccttcatcacctagaagacactctataggaccttca 

ccaaatgctcgacgaggaaaggcacctggttcttttgggagagcagcctcagattttgac 

aactgctcagaatacagtagcaagcattctgattccggttcaccgcgttcatcggacgaa 

cttaaacatcgaaaggatcttcaccagctatctaagtttgccggtgggtcaaaagaaatt 

gacattgaagatgacattgaactcattggccttggggatgcagattctgaggacagattg 

agtgatatctctgatagctgtctttcgatgggaacagaaactgatggctccatatgcagt 

gcagtcgagttgactcttttccctgaaaccgtgaagcctcttgaaataacggaagaacct 

gaaccacacttggtccctgagaagctcgagaaatcagcaaagatggtgaaaaccgtgccc 

aaagacaagactcgcacttggagctaccaactctttgcgagtttctacggacccgacgtc 

cctaagacttttatcagtcatttgcggagacagttaaacagcaaggggattttaatgttc 

gatgatcaagggactgagagaggccaagctattaggggatcagtgatatcgatcgtggtg 

ctttccaagaattatgcttcttccagctggtgtttgaaaaacctgctggagattttgaag 

tgcaaggaacaaatcgtgacgacagttttctacggagtagatccatccgatgttaggaaa 

cagactggagatttcgggaaagctttcaaggaaacatgtcgtggtagcaaagatggagac 

aagctttaa 

(SEQ ID NO:38) 

gene complement (join (122358. .122363,124018.-124089, 

126272. .126370,131256. .131351,132828. .132830, 
133476. .133515)) 
/gene= n Pept ide_2 0 " 

/note="GenScan; Pl-Prom, P2=0.712, P3=0.749, P4=:0.080, 

P5=0.083, P6=PlyA« 

peptideJ20|89_aa 

MIRDFAADKVALRAS PTRNFILRANDPQNEGLKSSKLWKLRYPARYPTS SASKSHKRTK 
THLI KLTQRQRS ALKKIQRKQLFKASSFD 
(SEQ ID NO; 39) 

CDS CDS_20|270Jbp 

atgatacgtgattttgctgccgataaagttgctctaagagcatctccaacaaggaacttc 
attttaagggccaatgatcctcaaaatgagggtttgaagagctctaaacttgtcgtcaaa 
ctcagatatccggcgagatacccgacatcatcggcgagcaaatctcacaaaagaacaaaa 
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acccatcttattaaattgacgcagcggcagcgttcggcgttaaaaaaaatacaaaggaaa 
cagctttttaaggcatcctcgtttgattga 
(SEQ ID NO: 40) 

gene join (134673. .134712,136050. .136161,136634. .136739, 

136919. .137191,138414. .138510,138601. .138672, 
139048 . .139809,139955. .140470,141213. .141218) 
/gene= " Pept ide_2 1 " 

/note="GenScan7 Pl=Prom, P2«0.008, P3=0.006, P4=0.279, 
P5=0.999, P6-0.913, P7=0.935, P8=0.697, P9=*PlyA" 
peptide_21 1 645_aa 

MRRRERRASGGSGGNGGGATGERTSTGGYSYGSVSA6ENRMMPENLKKHLAVSVRNIQWS 
YGI FWSVSASQPGLLEWGDGYYNGD I KTRKTVQASEVKADQLGLERSEQLRELYESLSLA 
ESSTSCGSQVNRRASASSLSPEDLTDTEWYYLVCMSPVFNIGEGVPGGVLANGQPIWLCN 
AHTADSKN^TRSLLAKSASLLTWCFPFLGGVLEIGTTE 

GTLSTRSDYQEIFDPLSHDKYIPVFGTEAFPTTSTSVYEQEPDDHDSFINGGGASQVQSW 
QFVGEELSNCVHQPLNSSDCVSQTFVGTTGRVTCGPRKS RNQRLDQI QEQNNRVNMDDDV 
HYQGVI STI FKTTHQLVLGPQFQNFDKRS S FTRWRRSSLSAKTLGEKSQNMLKKI I S EVP 
RMHQKKALLPDTPEDSGFKVGDETANHALS ERKRREKLNDRF ITLRSMI PS I SKTDKVS I 
LDDTIEYLQELQRRVQELESCRESDGKEMRMAMKRKKMEDEDERVSANCLK^ 
VNVEEDEPADTGYAGLTDNLRIGSFGNEWI ELRCAWREGIIiLEIMDVI SDLNLDSHSVQ 
SSTGDGLLCLTVNCKVQLLTQAKFLLPNRNLSKPKLIYILMLIAA 
(SEQ ID NO: 41) 

CDS CDS__21 1 1938_bp 

atgcgaagaagagaaaggagagcgtctggagggtctggaggtaacggcggaggagcgacg 
ggagagagaacttcaaccggaggatactcgtacggcagcgtatctgcaggagaaaacaga 
atgatgccggaaaatctaaagaagcacctcgctgtttcagttcgaaacattcaatggagt 
tacggaatcttttggtctgtctctgcttctcaaccaggactgttggagtggggagatgga 
tactacaatggagacattaagactaggaagacggttcaagcatcggaagtcaaagctgac 
cagttgggtcttgagagaagtgagcagcttagagagctttacgaatctctctccctagcg 
gagtcttcaacctcctgtggttctcaggtcaatagacgggcttccgcctcctctttgtct 
ccggaagatctcaccgacactgagtggtattacttagtatgcatgtctttcgtcttcaac 
attggtgaaggagttcccggaggagtgttggcgaacgggcaaccaatatggttatgtaac 
gctcataccgccgatagtaaagtcttcactcgctctcttcttgctaaaagtgcttcgctt 
ttgacagtggtttgcttcccatttcttggaggagttcttgagatcggcacgaccgaacat 
gttgcagagaacttaaacgtgatacaatgcgtgaagacattgttccttgaagctcctcat 
ggaactttatcaacgagatccgattatcaagaaattttcgatcctttaagcoacgataaa 
tacattccagtgtttggaactgaagcttttccgacaacttccacaagcgtgtatgagcaa 
gaaccagatgatcatgattcgttcatcaacggtggtggtgcatcccaggtacaaagctgg 
cagtttgtgggtgaagaactcagtaattgcgttcaccaaccgcttaattctagcgattgc 
gtttcccagacttttgttggaacaaccgggagagttacttgcggtccaaggaagagtagg 
aatcaacggttagatcagattcaagaacagaataaccgagtaaatatggacgacgatgtt 
cattaccaaggcgtgatctcgacgattttcaaaacaacgcatcagctagttcttggaccg 
cagtttcagaactttgataagcggtctagtttcacacggtggaggaggtcatcattgtct 
gcaaaaacgttgggagagaagtcgcaaaatatgttaaagaagattatatccgaggttcct 
cggatgcaccaaaagaaggcgttgttaccagacacaccagaagatagcgggtttaaggtt 
ggggatgaaaccgcgaaccacgccttgtccgagaggaaacgccgcgagaaattgaatgat 
cggttcataacgttgagatcaatgattccttcaattagtaagaccgataaggtgtcgatt 
cttgatgatacgatcgagtatcttcaagaacttcaaagacgggttcaagaattggaatct 
tgcagagaatctgacggtaaagaaatgcgaatggctatgaaaaggaagaaaatggaggat 
gaagatgaaagagtatcggcgaattgtttgaaaagcaagaggaaggagagtgagagtgat 
gtgaatgttgaagaagatgaaccggctgataccggttatgctggtctaacagataattta 
aggatcggttcgtttggcaatgaggtggttattgagcttaggtgtgcttggagagaaggg 
atattgcttgagataatggatgtcattagtgatctcaatttggactctcactcggtacag 
tcttcaaccggggatgggttattgtgtttaactgtcaattgcaaggtacagctactaaca 
caagctaaatttctgttacctaaccggaatttgagtaaaccgaaactgatatatatttta 
atgctgattgcagcataa 
(SEQ ID NO:42) 

Sene complement (join (142728 . .142733,143177. .143658, 
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144696. .145220,145303. .145399, 145477. .146472, 
146553 . . 146620 , 146700 . . 146889 , 147267 . . 147306) ) 
/gene= " Pept ide_22 n 

/note="GenScan; Pl=Prom, P2=0.971, P3=0.999, P4=0.739, 
P5=0.946, P6=0.078, P7=0.105, P8«PlyA" 
peptide_22 | 785_aa 

MEKRRSPRSSKSPGTPIFPKSPLVYESYKSGCGWKLINFFDFRHVKSGNKRLSSQKKPIR 
DSAGNVYTKSQLDLLKRLHERCQCHDRI VEGENSCKPKTRRRS FS S EREDES YESKPVQG 
LLEREIKRIKNAKEETSLSEVEKAKQMDLKNGRDCKKKSSEINLQVCVNEAAET^ 
EEKGKDRSKQFMEALDILSSNKELFITLLQDPNSFSAKKGQDLERPHVKERRDKSPSLAD 
DLDEIVLLKPRLPSLVDDRKYLRFKm^ 

VFRRKKRVE SDVFKLS I EKDVS PRRFTVERQQERSDS S PVYEVPKALS S LQTKL KERRER 
. LEKRRESFiaWSLDKEDLEVFDPNPHSYNVRSLSEKTSLRAPVEDGLEEDRYLESSSAES 
SIKRQEQEQPSPVSVLERIHMLDETVGLRNKEQIGLLSFDLVEKDSVHEFVKQLLQASRL 
NWTNLMARCNEETSLLDEFSHGNHNNDQLLLVLDYTDEILREIYRQDIKFW 
VVtf\n?ASFREBDLIHETLRRFDWSLLCCDSPKALDQV^ 

SDWENIMQGLWEI SHELRTMQI RLQVSHRFCVVYG I VPRGPLDLSTAPDKTRLHGEAI 
DFITNLQDVHKLAQSHI^SATTKYKLAADTKRCELIFEPGDLVWVYLTK^ 
KSKKLGPVEWERINPNVYRVRLPSHLRTSDVFNI KHLS PFKGDNDDPDSWANPS QPGGP 
DAAAS 

(SEQ ID NO;43) 

CDS CDS_22|2358_bp 

atggagaaaaggagatcaccaagaagcagcaagagtcctggaactccaattttccctaaa 

agccctcttgtctacgagagttacaagtctggttgtggttggaaactgattaacttcttc 

gatttcagacatgttaaatctggtaataagagactgagttcacagaagaaacccatcaga 

gactctgctggtaatgtttatactaaaagccaacttgatttgctcaagagacttcatgag 

agatgtcagtgtcatgatcggattgtggaaggagagaactcatgtaagcctaaaacgagg 

agaagatcctttagttctgaaagagaagacgagagctatgaatcaaaacctgttcagggt 

ttactagagagagagataaagaggatcaagaatgcaaaggaggaaacttctttgagtgaa 

gttgagaaggctaagcaaatggatctgaagaatgggagagattgtaaaaagaagagctca 

gagattaaccttcaagtttgtgtgaatgaagctgctgaaacattgatcagttccaaggca 

gaggagaaaggaaaagaccggtccaagcagttcatggaagcattagatattctaagctcg 

aacaaagagttgttcattacactcttacaagatcctaactcgttctcagccaaaaaaggt 

caagacttggagagacctcatgtgaaggaacggcgtgataaatctccttcactggctgat 

gatttggatgagatcgttctcttgaagcctagactaccaagcttggttgacgacagaaag 

tacttgagattcaaacatctaaccaagaagttgaagcttgttgttggatctaacaaagac 

accgaggcttctggcgatgtttctagcagtgcagttggctatagaagcccggagt caeca 

gtgtttaggcgcaagaaacgtgttgagtctgatgtcttcaagctaagcattgagaaggat 

gtttcgccaaggaggttcacagtggaaaggcaacaggagagatcagattcttcaccggtt 

tatgaagttcctaaagcactgagtagcttacagacaaaactcaaagagagaagagagagg 

ctggagaagagaagagagagcttcaagttgtggtccttggacaaggaggacttggaggtt 

ttcgatccaaatccgcacagctacaatgtaaggtctctcagtgagaaaacaagcctgaga 

gctcctgtagaagacggactagaggaagatagatatctagagagctcatcagctgagtcg 

agtataaaaagacaggaacaagagcaaccgagtcctgtctctgtactagagaggattcat 

atgttagatgaaacagttggcctcagaaacaaagagcagattggactgttatcttttgat 

cttgtcgagaaagactctgttcacgagtttgtgaagcaacttctccaagcttcaagactg 

aactggaccaatcttatggcgagatgcaacgaagaaacatcactactagacgaattctca 

cacggtaatcacaacaacgaccagcttcttcttgttcttgactacacagacgaaattctc 

cgcgagatttaccgtcaagacatcaagttttggcctttcaagccgtctcagagctcgaga 

gtcgtcaatgtaccagcttcctttagagaagaagatctgatacatgagacgttgagacgt 

tttgactggagtttactctgctgtgactctccaaaagcattggatcaggtcgttgaggct 

gatctgataaagccatcatgtctttggttggattgtggtggtgaagctgaaggtgtagtc 

tctgatgtcgttgaaaatattatgcaagggttggtggttgagatatctcatgagctaaga 

acaatgcagattaggttacaggtttctcaccgtttttgtgttgtgtatggtattgttcct 

cgtggtccgttggatctttcgactgcaccagacaaaactcgccttcatggagaggccatt 

gactttattaccaaccttcaagacgtccataagctcgcgcagtctcatttggaatcggct 

acgactaaatacaaacttgcagcagacaccaagcgttgtgagctgatcttcgaaccaggt 

gatctcgtttgggtgtacttaactaaagaacgactccccttgcgtgattacaataaattg 

aagtctaaaaagcttggtccagtcgaagtggtggaacgcatcaatccgaatgtgtatcgt 

gtcaggttgccttcacaccttcgtacgtctgacgtcttcaacatcaagcatttgtctccg 
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ttcaaaggcgacaatgatgatccagattcgtgggcaaatccttctcaaccgggaggacct 
gatgcagcagcatcataa 
(SEQ ID NO: 44) 

gene join (147355. .147394,147557. .147628,149555. .149586, 

150051. .150111,155933. .157925,158007. .158123, 
158202. .158377,158531. .158676,158813. .158831, 
159102. .159107) 
/gene="Peptide_23 " 

/note= M GenScan; Pl^Prom, P2=0.591, P3=0.063, P4=0.049, 
P5=0.902, P6«0.998, P7=0.911, P8=0.987, P9=0.974, 
P10=PlyA" 
peptide_23 | 871_aa 

MGPLTIKEAFSLVLEKEGEEEEDEIFRILGSSDRRNIDTIRIFEGPRFGSGPDISWAIIjV 
GLVI ITPNPPRITNTR13APYCRLATKKNS YPLRKTFLHHTPS PVSPMAFTFFSPHPVFLS 
LGRTTSSFSSYKPAYSPFSRNPRNNLQIAAGPTRRSSYPNPADDDPPEAPEDSMHGVSKF 
QQIQRQAARARKLEEEDFEKNRNTYLSAIADVEDAPETGRDDVESGGDLFSDIDRAISMK 
RSEFVKKGLLQPNPPKTASSKKIDEEEEEDDAVDELDEEEAVDLDEIDKLTGLTEASDEE 
DWVDEEGNPRI I SKKKEHQFEFDLDDFGESKARI VEPKFRLSLAELLDESKVVPI S VYGD 
LDVEITGIQHDSRGVSAGDLFVCCDGGDDSVLSEADKRGAVAVVASKEIDIEDTLGCRAL 
VI VEDTEAVLAALAS S FYRHPS KDMAVI GVTGTNGKTTTT YL I KS LYEAMGVRTGMF S S V 
SCYVHGDNKMDSTTTS PDAVLVQSMMAKMLHNGTEALVMEAS PQELASGKCDEVDFD I AV 
FTNLAREDSGFRGTDEEYRDAEAKIjFARMVDPERHRKVVNIDD 

TFAMENTKADVHPLKFELS IiFETQVLLNTPQGI LEI S SGLLGRHNI YNILAAVAVGI AVG 

APLEDIVRGVEEVDAVPGRCELIDEEQAFGVIVDHANTPDGLSRLLDSVRELKPRRIITV 

I GCAGETERGKRPVMTKI ATEKSDVTMLTSDNPGNEDPLD I LDDMLAGIGWTMQE YLKHG 

EHDYYPPLANGHRLFLHDIRRVAVRCAVAMGEEGDMVWAGKGHEAYQLEGDKKEFYDDR 

EECREALQYVDELHQAGIDTSEFPWRLPESH 

(SEQ ID NO: 45) 

CDS CDS_23|2616_bp 

atgggtcctctcaccattaaagaggcttttagtctcgtcctggagaaggagggagaggag 
gaggaagatgaaatatttcgaattttaggtagttcggatagaagaaatatagacaccata 
cggatatttgaaggtcctcggttcggttcgggtccggatatttcgtgggctattttggtg 
ggcttggtcattattacgccaaacccacctcgtatcactaacactcgaaacgcaccgtat 
tgtcgtttggcaacaaaaaaaaactcttatccccttcgaaaaacctttcttcatcacact 
ccctctcctgtctctccaatggccttcactttcttctctcctcaccccgtcttcctctct 
ctaggtcgaaccacctcttccttctcctcctacaaaccggcatactcaccattctcccga 
aatccccgtaataatcttcagttagcggcgggccccacccgtcgtagctcttacccgaac 
ccggcggatgacgacccgcccgaagccccggaggattcgatgcacggcgtctccaagttt 
cagcagatacagcgccaagccgctcgagcgcggaagctggaggaagaagacttcgagaag 
aaccgaaacacgtacctctcggccatcgctgacgtggaagacgcgcccgagacgggacgc 
gatgacgtggagtctggaggcgatctgttctcggacatcgatagagccatctcaatgaaa 
cgtagcgagttcgttaaaaaaggactgctccaacctaaccctcccaaaacggcgtcgtct 
aagaagatcgacgaggaagaagaagaagacgacgctgttgatgagcttgacgaagaagaa 
gctgtggatttagacgagatcgataaactgactggattaaccgaagcttccgacgaagaa 
gactgggtcgacgaggaagggaaccctaggatcatcagcaagaagaaggagcatcaattc 
gagttcgatttggatgatttcggcgaatccaaggcgagaatcgtggagcctaagttcaga 
ctgagcttagccgagctcttggacgagagcaaagtggtgccgatctcagtctacggcgac 
ttagacgtcgagatcaccggaatccagcacgattcgcgaggcgtaagcgccggagatctc 
ttcgtgtgctgcgacggaggagacgactccgtcctgagcgaagctgacaagagaggagcg 
gtggcggttgtggctagcaaagagatcgatattgaagatacgttaggctgtagagcgctc 
gtcatcgtcgaagacaccgaagcagtcttggctgcgttagcttcttcgttttataggcat 
ccgtcgaaggacatggcggttatcggagtcacggggactaacgggaagacgaccaccacg 
tatttgattaaaagcctctatgaagctatgggtgtgagaacaggaatgttcagcagtgtt 
tcttgttacgtccatggagataacaagatggattcaacgacgacgagtcctgatgctgtt 
ttggttcagagtatgatggcgaagatgttgcataatggaaccgaagctctggttatggaa 
gcttctcctcaagaactcgcttcagggaaatgcgacgaagttgatttcgacattgcggtc 
ttcacgaatttagccagagaggatagtggctttcgcggtactgatgaggagtatagagat 
gctgaagccaagttgtttgcaagaatggtcgacccggaaagacacaggaaagtggttaac 
attgacgatccaaacgcagcgtttttcgtccagcaagggaaccctgatgttcctgttgtg 
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acgtttgcaatggagaacacgaaagcggatgttcacccgttgaagtttgagctgtctttg 
tttgagacacaggttttgctcaatacacctcagggtatactggaaatttcgtctggtttg 
ttaggacggcataacatttataacattcttgctgctgttgctgtcgggatagctgtcgga 
gc t cct ct t gaggat at t gt t agaggt gt t gaggaagt cgat gctgt cccggggaggtgt 
gagttgattgatgaggaacaagcttttggtgttattgtggatcatgctaacacacctgat 
ggtttgtcaaggctgcttgattcggttcgagagcttaagccaagaagaatcattactgtt 
attggctgtgcgggtgagactgagagagggaaacgaccggttatgacgaaaatcgcaact 
gaaaagagtgatgtgacgatgttgacatctgataatccggggaatgaagatccattggac 
atattggatgacatgttggctgggattggatggacgatgcaagagtatctgaaacacgga 
gaacatgattactatcctccattggcaaatggtcatagactcttccttcacgacattaga 
cgtgtagctgtgcgttgtgctgttgcgatgggtgaagaaggtgacatggttgtggtagca 
gggaaaggccacgaagcgtatcagcttgaaggtgataagaaagagttctatgatgatcga 
gaggaatgtcgggaagcattacaatacgttgatgagcttcatcaagctggaatagacaca 
agcgagttcccatggaggttaccagagagtcattaa 
(SEQ ID NO:46) 

gene complement (join(159434 . .159439,159685. . 159707, 

159839. .161876,161980. .162019) ) 
/gene= " Pept ide_24 " 

/note= n GenScan; Pl=Prom, P2=0.887, P3=0.985, P4»PlyA" 
peptide_24 1 686_aa 

MLARVCRFES SS S SS VSAARFFCTGS I RHALAEKSRDGESGEAGFRGESLKLRSGS YE I K 
GLEDAI DLFSDMLRSRPLPSVI DFNKLMGAWRMERPDLVI SLYQKMERKQI RCD IYS FT 
ILIKCFCSCSKLPFALSTFGKLTKLGLHPDVVTFTTLLHGLCLDHRVSEALDLFHQICRP 
DVLTFTTLMNGLCREGRVVEAVALLDRMVENGLQPDQITYGTFVDGMCKMGDTVSAI^ 
RKMEEISHIKPNWIYSAIIDGLCKDGRHSDSHNLFIEMQDKGIFPNIVTYNCMIGGFCI 
SGRWSAAQRLLQEMLERKI SPNWTYNALINAFVKEGKFFEAAELYDEMLPRGI I PNTIT 
YNSMIDGFCKQDRLDAAEDMFYLMATKGCS PDVFTFTTL IDGYCGAKRIDDGMELLHEMP 
RRGLVANTVTYOTLIHGFCLVGDLNAALDLSQ 

DALEMFKAMQKSKMDLDASHPFNGVEPDVLTYNI LI CGLINEGKFLEAEELYEEMPHRGI 
VPDTITYSSMIDGLCKQSRLDEATQMFVSMGSKS FS PNWTFNTL I NGYCKAGRVDDGLE 
LFCEMGRRGI VADAI I YITLI YGFRKVGNINGALDI FQEM I S SGVYPDTIT I RNMLTGFW 
S KEELERAVAMLEDLQMS VGYQLEDE 
(SEQ ID NO: 47) 

CDS CDS_24|2061_bp 

atgttggctagggtttgcagattcgagtcttcctcttcgtcttctgtgtctgcggctaga 
tttttctgtacgggatcgattcgtcatgctctggccgagaaaagcagggatggagagagt 
ggcgaagcaggttttagaggagagagtttgaaactgcgaagtggatcttatgaaatcaaa 
gggttagaggatgcgattgatttgttcagtgacatgcttcgatctcgtcctttaccttct 
gtgattgatttcaacaagctaatgggtgcggtggtgagaatggaacgcccggatcttgtg 
atttctctctatcaaaagatggaaaggaaacagattcgatgtgatatatacagcttcacc 
attctgataaaatgtttctgcagttgctctaagctcccctttgctttgtctacatttggt 
aagctcaccaagcttggactccaccctgatgttgttaccttcaccaccctgctccacgga 
ttatgtcttgatcacagggtttctgaagccttggatttgtttcatcaaatttgtagacca 
gatgtcctaacgttcaccacgctgatgaatggtctttgccgcgagggtcgagttgtcgaa 
gccgtagctctgcttgatcggatggtggaaaatggtctccagcctgaccagattacttac 
ggaacatttgtagatgggatgtgtaagatgggcgacactgtgtctgcattgaatcttctg 
aggaagatggaggagataagccacatcaaacccaatgtggttatctatagtgccatcatt 
gatggcctttgtaaagatggacgccatagcgattctcataatcttttcattgaaatgcaa 
gacaagggaatctttccaaatatagttacctacaactgtatgatcggtggattttgcatc 
tctggtagatggagtgcagcccagcggttgttgcaagaaatgttagaaaggaagatcagc 
cctaatgttgtaacttataatgctttgatcaatgcatttgtcaaggaaggcaagttcttc 
gaggctgcagaattatacgatgagatgcttccaaggggtatcattcctaatacaatcaca 
tataattcaatgatcgatgggttttgcaaacaggatcgtcttgatgctgctgaggacatg 
ttttatttgatggctaccaagggctgctctccggacgtattcactttcactactctcata 
gacggatattgtggggctaagaggatagatgatggaatggaacttctccatgagatgcct 
agaagaggattagttgctaacacagttacttacaacactcttattcacgggttctgtctg 
gtgggcgatcttaatgctgctctagacctttcacagcagatgatttctagtggtgtgtgc 
cctgatatcgttacttgtaacactttgctggacggtctctgcgataatgggaaactaaaa 
gatgcattggaaatgtttaaggctatgcagaagagtaagatggatcttgatgctagtcac 
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cccttcaatggtgtggaacctgatgttctaacttacaatatattgatctgcggcttgatc 
aatgaagggaagtttttagaggccgaggaattatacgaggagatgccacacagaggtata 
gtcccagatactatcacctatagctcaatgatcgatggactatgcaagcagagccgccta 
gatgaggctacacaaatgtttgtttcgatgggtagcaagagcttctctcccaacgtagtg 
acatttaacacactcattaatggctactgtaaggcaggaagggttgatgatgggctggag 
cttttctgcgagatgggtcgaagagggatagttgctgatgcaattatttacatcactttg 
atttatggttttcgtaaagtgggtaatattaatggggctctagacattttccaggagatg 
atttcaagtggtgtgtatcctgataccattactatccgcaatatgctgactggtttttgg 
agtaaagaggaactagaaagggcagtggcaatgcttgaggatctgcagatgagtgtgggg 
tatcagttggaggatgaatga 
(SEQ ID NO:48) 

gene join (162193. ,162232,163044. .163095,164032. .164606, 

165002. .165007) 
/ gene= 11 Pept ide_2 5 " 

/note= "GenScan; Pl«=Prom, P2«0.976, P3=0.992, P4*PlyA" 
peptide_25 | 208_aa 

MNLGQNQPRTRSYSTLIiAQTNQNPSATECSGCQPPRRSNIiGRIKIRVHIIiVIiERHASRRP 

AHHHRFTLPPGNHTSVHRNPTGAHKLHTSSSRTL^ 

HPRBRFTCITNLPAGDLHPPPTRPRSTKPKRTTKRRKEKKKN^ 

ITPRRLEPMTVEPKRLPRRGYEGRRRWS 

(SEQ ID NO: 49) 

CDS CDS_25|627_bp 

atgaaccttggccagaatcaacctcgtacccggtcttactcaactcttttagcccaaaca 
aaccaaaaccctagcgctacggaatgctcgggatgtcagccgccgcgtagatctaacctt 
gggagaatcaagatacgtgttcacatcctcgtcctcgagagacacgcgtcacgcaggcca 
gcacaccaccaccgcttcacgctaccaccggggaaccacacctcagttcaccggaaccca 
accggagctcacaaactccacacatcctcatcgcgaacactttttcgatctaggatggct 
ctaatcggagagagcatcaatcgtcgctacgagatctcatcccccgtagtttatcgcaca 
caccctagagaacgcttcacatgcatcaccaaccttcccgccggagaccttcaccctcct 
ccgacgagaccacggtcaaccaaacccaaacgaaccacaaagagaagaaaagaaaaaaaa 
aaaaacctaaaccctaataaactaggagccgaaagccggcggcgcaagacagtggagatc 
atcacaccccggagactagaaccgatgacggtggaaccgaagaggcttccccgtcgcgga 
tacgaaggccggcgtcgatggagctaa 
(SEQ ID NO: 50) 

gene complement (join (166863 . .166868,167079. .167197, 

167582. .169685,170895. .171030,173614. .173669, 
173934. .173973) ) 
/gene= " Pept ide_26 " 

/note="GenScan; Pl=Prom, P2=0.664, P3=0.657, P4=0.831, 
P5=0.314, P6ePlyA" 
peptide_26 | 804_aa 

MEAPNYPIFFGLNLGVPLEGGRSGTYSGPGSSLFGFRIPGVKDPSPIRLFLNYGSGSVRI 
LADS SRVFFRDRRRTKFRRNKNKMLARVCGFKCSS SPAESAARLFCTRS IRDTLAKASGE 
SCEAGFGGESLKLQSGFHEIKGLEDAIDLFSDMLRSRPLPSVVDFCKLMGVVVRMERPDL 
VISLYQKMERKQIRCDIYSFNILIKCFCSCSKLPFALSTFGKITKLGLHPDVVTFTTLLH 
GLCVEDRVSEALDFFHQMFETTOTPNVVTFTTLMNGL 

TQI TYGTIVDGMCKKGDTVSALNLLRKMEEVSHI I PNWI YSAI I DSLCKDGRHSDAQNL 
FTEMQEKGI FPDLFTYNSMI VGFCSSGRWSDAEQLLQEMLERKI SPDVVTYNAL INAFVK 
EGKFFEAEELYDEMLPRGIIPNTITYSSMIDGFCKQNRLDAAEHMFYLMATKGCSPNLIT 
FNTLIDGYCGAKRIDDGMELLHEM'ETGLV^ 

SSGLCPDIVTCDTLLDGLCDNGKLKDALEMFK^QKSKKDLDASHPFN 
ISGLINEGKFLEAEELYEEMPHRGIVPDTITYSSMIDGLCKQSRLDEATQMFDSMGSKSF 
SPNVVTFTTLINGYCKAGRVDDGLELFCEMGRRGIVANAITYITLICGFRKVGNINGALD 
IFQEMISSGVYPDTITIRNmiTGLWSKEELKRAVAMLEKLQMSMVYYWSELKRHTFQKIS 
GVKRCLGVCPFCSCHHGYRQARSS 
(SEQ ID NO: 51) 

CDS CDS_26|2415_bp 
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atggaggcaccaaattatcctatattttttggacttaatcttggtgtacccctagagggt 

gggcgttcgggtacctattcgggtttcggttcgagtctattcggatttcggatttttggg 

gtcaaagattttagccccattcggttatttctaaattacggttcgggttcggttcggatc 

cttgcggattcgtcacgagttttttttagagatcgacgaagaacaaaatttaggcgaaac 

aaaaataaaatgttggctagggtttgtggattcaagtgttcttcttctcctgctgagtct 

gcggctagattgttctgtacgagatcgattcgtgatactctggccaaggcaagcggagag 

agt tgcgaagcaggt 1 1 1 ggaggagagagt 1 1 gaagc t gcaaagt gggt 1 1 cat gaaat c 

aaaggtttagaggatgcgattgatttgttcagtgacatgcttcgatctcgtcctttacct 

tctgtggttgatttctgtaaattgatgggtgtggtggtgagaatggaacgcccggatctt 

gtgatttctctctatcagaagatggaaaggaaacagattcgatgtgatatatacagcttc 

aatattctgataaaatgtttctgcagctgctctaagctcccctttgctttgtctacattt 

ggtaagatcaccaagcttggactccaccctgatgttgttaccttcaccaccctgctccat 

ggattatgtgtggaagatagggtttctgaagccttggatttttttcatcaaatgtttgaa 

acgacatgtaggcccaatgtcgtaaccttcaccactttgatgaacggtctttgccgcgag 

ggtagaattgtcgaagccgtagctctgcttgatcggatgatggaagatggtctccagcct 

acccagattacttatggaacaatcgtagatgggatgtgtaagaagggagatactgtgtct 

gcactgaatctgctgaggaagatggaggaggtgagccacatcatacccaatgttgtaatc 

tatagtgcaatcattgatagcctttgtaaagacggacgtcatagcgatgcacaaaatctt 

ttcactgaaatgcaagagaaaggaatctttcccgatttatttacctacaacagtatgata 

gttggtttttgtagctctggtagatggagcgacgcggagcagttgttgcaagaaatgtta 

gaaaggaagatcagccctgatgttgtaacttataatgctttgatcaatgcatttgtcaag 

gaaggcaagttctttgaggctgaagaattatacgatgagatgcttccaaggggtataatc 

cctaatacaatcacatatagttcaatgatcgatggattttgcaaacagaatcgtcttgat 

gctgctgagcacatgttttatttgatggctaccaagggctgctctcccaacctaatcact 

ttcaatactctcatagacggatattgtggggctaagaggatagatgatggaatggaactt 

ctccatgagatgactgaaacaggattagttgctgacacaactacttacaacactcttatt 

cacgggttctatctggtgggcgatcttaatgctgctctagaccttttacaagagatgatc 

tctagtggtttgtgccctgatatcgttacttgtgacactttgctggatggtctctgcgat 

aatgggaaactaaaagatgcattggaaatgtttaaggttatgcagaagagtaagaaggat 

cttgatgctagtcaccccttcaatggtgtggaacctgatgttcaaacttacaatatattg 

atcagcggcttgatcaatgaagggaagtttttagaggccgaggaattatacgaggagatg 

ccccacaggggtatagtcccagatactatcacctatagctcaatgatcgatggattatgc 

aagcagagccgcctagatgaggctacacaaatgtttgattcgatgggtagcaagagcttc. 

tctccaaacgtagtgacctttactacactcattaatggctactgtaaggcaggaagggtt 

gatgatgggctggagcttttctgcgagatgggtcgaagagggatagttgctaacgcaatt 

acttacatcactttgatttgtggttttcgtaaagtgggtaatattaatggggctctagac 

attttccaggagatgatttcaagtggtgtgtatcctgataccattaccatccgcaatatg 

ctgactggtttatggagtaaagaggaactaaaaagggcagtggcaatgcttgagaaactg 

cagatgagtatggtatattattggtctgaactaaagaggcacaccttccagaagatttca 

ggtgttaaaagatgtttaggtgtctgcccgttctgtagctgtcaccatggttatcgtcaa 

get cggt ct t catga 

(SEQ ID NO: 52) 

gene complement (join (1752 61. .175266,176231. .176253, 

176380. .177545,177619. . 178394, 180171 .. 180210) ) 
/gene= 11 Pept ide_2 7 " 

/note="GenScan; Pl«Prom, P2=0.639, P3=0.247, P4=0.863, 
P5=PlyA« 

peptide_27 | 654_aa 

MLARVYRSGS S S S PAVSAARLFCTRS I RHALAKKSRDGESGFGGESLKLRSGFHEI KGLE 
DAI DLFGDIWRSRPLPSVIDFCKLMGVVVRMGRLD WI SLHRKMEMRRVPCNAYSFT 
KCFCSCSKLPFALSTFGKITKLGFHPTVVTFSTLI^GLCVEDRISEALDLFHQMCKPNVV 
TFTTLMNGLCREGRWEAVALLDRMLEDGLQPNQITYGTIV^ 

EEVSHIKPNWIWPLERRTCMINGFCSSGRWSEAQQLLQEMLERKKISPDVVTYNALINA 
FVKEGKFFEAEELYDE^PRGIIPSTITYSSMIDGFCKQNRLDAAEHMFYLMATKGCSPD 
I ITFNTLI AGYCRAKRVDDGI KLLHEMTEAGLVANTITYTTLIHGFCQVGDLNAAQDLLQ 
EMVSSGVCPNVVTCNTLLDGLCDNGKLKDALEMFKAMQKSKMDIDASHP 
NI LI SGLINEGKFLEAEEL YEEMPHRGI VPDTITYNSVI HGLCKQSRLDEATQMFDSMGS 
KSFSPNVVTFTTLINGYCKAGRVDDGLELFCEMGRRGIVANAITYITLIHGFRKVGNING 
AIiDIFQEJMASGVYPDTITIRNMLTGLWSKEELKRAVAMLEDLQMSVGYQLEDE 
(SEQ ID NO:53) 
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CDS CDS_27 1 1965_bp 

atgttggctagggtttatagatccggatcttcttcttctcctgctgtgtctgcggctaga 

ttgttctgtacgagatcgattcgtcatgctctggccaagaaaagcagggatggagagagt 

ggttttggaggagagagtttgaagctgcgaagcggatttcacgaaatcaaagggttagaa 

gatgcgattgatttgttcggtgatatggtacgatctcgtcctttaccttcagtaattgat 

ttctgtaaattgatgggagttgtggtgaggatgggaaggctcgatgttgtgatttctctc 

cataggaagatggaaatgaggcgggttccatgtaacgcatacagcttcaccatcctgatg 

aagtgtttctgcagctgctctaagctgccgtttgctttgtctacatttggtaagatcacc 

aagcttggttttcatcccactgttgttaccttcagcaccctgctccacggattatgtgtg 

gaagacaggatctctgaagccttggatttgtttcatcaaatgtgtaaaccaaatgtcgta 

accttcaccacgctgatgaacggtctttgccgtgagggtcgagttgtcgaagctgtagct 

ctgcttgatcggatgctagaagatggtctccagcctaaccagattacttatggaacaatc 

gtggatgggatgtgtaagatgggagacactgtgtctgcattgaatcttctgaggaagatg 

gaggaggtgagccacatcaaacccaatgtggtaatctggcctttggaaagacggacctgt 

atgattaatggattttgtagctctggtagatggagtgaagcccagcagttgttgcaagaa 

atgttagaaaggaagaagatcagccctgatgttgtaacttataatgctttgatcaatgca 

tttgtcaaggaaggcaaattctttgaggctgaagaattatacgatgagatgcttccaagg 

ggtataatccctagtacaatcacatatagttcaatgatcgatggattttgcaaacagaat 

cgtcttgatgctgctgagcacatgttttatttgatggctaccaagggctgctctccggac 

ataatcactttcaatactctcatagccggatactgtagagctaagagggtagatgatgga 

ataaaacttctccatgagatgactgaagcaggattagttgctaacacaattacttacacc 

actcttattcacgggttctgtcaggtgggcgatcttaatgctgctcaagaccttctacag 

gagatggtctctagtggtgtgtgccctaatgtcgttacttgtaacactttgctggacggt 

ctctgcgataatgggaaactaaaagatgcattggaaatgtttaaggctatgcagaagagt 

aagatggatattgatgctagtcacccctttaatggtgtggaacctgatgttcaaacttac 

aatatattgatcagtggcttgatcaatgaagggaagtttttagaggccgaggaattatac 

gaggagatgccacacagaggtatagtcccagatactatcacctataactcagtgatccat 

ggtttatgcaagcaaagccgcctagatgaggctacacaaatgtttgattcgatgggtagc 

aagagcttctctccaaacgtagtgacctttactacactcattaatggatactgtaaggca 

ggaagggttgatgatgggctggagcttttctgcgagatgggtcgaagagggatagttgct 

aacgcaattacttacatcactttgattcatggttttcgtaaagtgggtaatattaatggg 

gctctagatattttccaggagatgatggcaagtggtgtgtatcctgataccattactatc 

cgcaatatgctgactggtttatggagtaaagaggaactaaaaagggcagtggcaatgctt 

gaggatctgcagatgagtgtgggatatcagttggaggatgaataa 

(SEQ ID NO: 54) 

gene complement (join (180597. .180602,180714. .181634, 

181896. .182003,182169. .182208) ) 
/ gene = " Pep t i de_2 8 11 

/note= n GenScan; Pl=Prom, P2=0.998, P3=0.959, P4=PlyA" 
pep t ide_2 8 1 3 4 2_aa 

MRGGASGNNVLETINAAATAFASSDDRVHHQPSPIHRRKRIGKAALAPEPVPTDSTSNSG 
YRSVMTALPFIAPPSSPASFFQSEPPSATQSPVGILSPSPLPSNSHNNNNNSEERPSIFA 
IGPYAHEPQLVSPPVFSTYTTEPSSAPVTPPLDESFYLTTTTPSSPEVPFAQLFNSSSNY 
GVRSPVSNYEFQFYQLPPGSPIiAQLISPSSVMSGSGATSPFPDGIiAQFQVSDPPKIiLSPG 
KLRCSKSVTTPKEQNKIVRPNKPVSFT)LDADHFIRC^KKIjRTT 
SNKEFDFGTTDE I HLTGDDEHRDSTKNSSDWS FPVMQSGTLS 
(SEQ ID NO: 55) 

CDS CDS_28 | 1029jbp 

atgagaggcggcgcgagtggaaacaacgttttggagactataaacgcagccgctactgcg 
ttcgcttcctctgatgatcgtgttcatcaccaaccttccccgattcatagaagaaaacga 
atcgggaaagctgctcttgctcctgaaccggttcctaccgattccacatccaattccggt 
tatcgttcggttatgacggctcttcctttcatagccccaccttcctctccagcttccttc 
ttccaatcagaacctccttccgctacacagtcacctgtagggatcctctcctttagtcct 
ctaccttctaacagccacaacaacaacaacaacagcgaagaacgtccttcgatcttcgcc 
atcggaccttacgctcacgaacctcagctggtttctcctccggttttctctacttacaca 
accgaaccgtcttcagctccggtcacgccgcctctcgacgagtctttctacttaaccacc 
accacaccgtcttcgcctgaagtccctttcgctcagctctttaactccagcagtaactac 
ggtgtcaggtctccggtgtctaactacgagtttcagttttaccaacttcctcccggtagt 



75 



WO 03/006622 



PCT/US02/22217 



ccactcgctcagcttatctcccccagctcggttatgtccggttctggtgcgacttctccg 
tttcctgacggactcgctcagtttcaagtctctgatccaccaaagctgctgagccctggt 
aaactgcgttgctccaagtctgttacaactcctaaagagcagaacaagattgtgagaccg 
aacaaaccggtttcgttcgatcttgatgcggatcatttcattagatgcgttgataagaag 
ctgagaacaacgttccctgaagcgtctgatcaagaagcagctcaacattcctcctccgga 
tccaataaagaattcgatttcggcaccaccgatgagatacatttgaccggtgatgatgag 
catagagattcgaccaagaacagcagcgattggtccttccctgtgatgcaatcaggcaca 
cttagctaa 
(SEQ ID NO: 56) 

gene complement (join (182434 . .182439,182881. .183285, 

184053 . . 184094, 186762 . . 186801) ) 
/gene= " Pept ide_29 11 

/note="GenScan; Pl=Prom, P2»0.180, P3=0.217, P4=PlyA" 
peptide_29 1 148_aa 

MKSSHRAKQVEIGFFWREITFFRFWRKITPFI^WRENAPSGPHEKLRPSGFGGKLRFSGF 
GGKI TFSGFDEKLCFSGLDGKLRFS ILVGNYVCRFWREI VFFGFDGKLRFLGFGVKVCFL 
VLARNWVFRFLVGNCLFPILAGYCIFRF 
(SEQ ID NO: 57) 

CDS CDS_29|447_bp 

atgaagtcaagccaccgtgcaaaacaagttgaaattggctttttttggcgagaaattacg 
tttttccggttttggcggaaaattacgtttttccggttttggcgggaaaatgcgttttcc 
ggttttcacgagaaattacgtttttccggttttggcggaaaattacgtttttccggtttt 
ggcgggaaaattacgttttccggttttgacgagaaattgtgtttttccggtttggacggg 
aaattgcgtttttcgattttggtgggaaattatgtttgtcggttttggcgggaaattgtg 
tttttcggttttgacggaaaattgcgttttctcggttttggcgtgaaagtgtgttttttg 
gtcttggcgagaaattgggttttccgatttttggtgggaaattgcctttttccgattttg 
gcgggatattgcatttttcgattttga 
(SEQ ID NO: 58) 

gene join(187074. .187113,187493. .187526,188534. .188733, 

189397. .189500,189760. .190260,190331. .191063, 
191102. .191473,191583. .191882,192117. .193246, 
193319. .193375,194106. .194333,194405. .195500, 
195582. .195881,196902. .198149,198436. .198441) 
/gene= 11 Pept ide_ 3 0 11 

/note="GenScan; Pl=*Prom, P2»0.436, P3=0.402, P4=0.107, 
P5=0.460, P6=0.572, P7=0.532, P8=0.954, P9=0.471, 
P10=0.400, Pll=0.949, P12«0.995, P13=0.193, P14=0.202, 
P15=PlyA" 

peptide_30 1 2100_aa 

MIRTEPEFQREVFKEEDLQHEQYRQICFLFSTTSQEVEIIECGVQIVRDRSGRKRSDADA 
VNINHNNNCWKTTMKVSTKEMKFPSKEVS PEQEESRTSQKRQQQGS I KS S S FRDFHRYSA 
LMAS SSS SSS PRTWRYRWTSFHGPDVIUCrFLTHIiRKQFNCNGI SMFDDQGI ERGHTI AP 
ALTQAIRESRISIVVLTKHYASSRWCLDELLGILKCKEEIGQIVMTIFYGVDPSDVRKQT 
GDFGK\fl?KDTCRRKTEEERRRWSQALTDVGNIAGEH^ 

TI SRDFEDMVGI EAHLDKMQSLLHLDDEDGAMFAGI CGPAGI GKTTIARALHSRLS S SFH 
LTCFMENLRGSCNSGLDE YGLKLRLQELLLSKI FNQNDMRIYHLGAIPQRMCDQKVLI IL 
DDVDDIiQQLEAIiADETNWFGDGSRIVVTTEDQELLEQHGINNTYYVDIjPTDD 
YAFRRSLTPYGFETLVERTTELCGKLPFGLRVQFYAERKKTTGKIDAVLRVGYDSLHENE 
QTLFLLIAI FFNYQDDGHVKTMLADTNLDVRLGLKTLAYKSLTKI SSQGKIVMHKLLQQV 
GRQAVQRQEPWKRRILIDPQEIOTVLEPWIOIQVLTDTDEI RDVLENDSGSRNLMGVS FDM 
STILHDMDISARAFTSMRNLRFLKVYKTRCDTNWVHLPEDMEFPPRLRLI^ 
LPRTFCTEHLVELYLRDTELEQLWEGTQPLTNLK^ 

LDRCRSLVEIHSSVGNLHKLESLEVAFCYNLQVVPNLFNLASLESFMWGCY 
STTITELSIPDTLLEEFTEPIRLWSHLQRLDIYGCGENLEQVRSDIAVERIPDCIKDLQR 
LEELTIFCCPKLVSLPELPRSLTLLIVYECDSLETLAPFPLGSEIEALSFPECFRLDREA 
RRVI TQLQS S WVCLPGRNI PAEFHHRVI GNFLAI CSNAYRFKLCAWS PKQVMVEDED I E 

LLOIILINGCPMKSPIKSIYNLRIRIQSEHLFIFPSTMLKEDRQIjGQYSEILFKFSTTSQ 
NTEI I KCGVQILRDRRS CDSKS EQDDDE SLYS SLDYDAPRVDTINLLKEHKDLI SDYFTS 
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FFSLWCIiDELLGILKCKEEMGQIVMTIPYGVDPSDVRKQTGDFGKVFKETCRRKTEEERR 
RWSQALTDVGNIAGEHFLNWDKESEMIEKIARDVSN^^ 

SLLHLDDEDGAMFVGI CGPAGI GK3TI ARALHSRLSSTFQHTCFMENLRGSCNSGTDEYG 

LKLRLQELLLSKI FNQNGVKLFHLGAI KERLCDLKVL I VLDDVDDLQQLEALADDTNWFG 

DGSRIIVTTEDQEILEQHGISNTYRVDFPTQVDARQIFCRFAFRQIiSAPHGFEKIjVDRVI 

KLCSlsnjPLGLRVMGSSLRRKKVDDW 

LIACFFNYKDDDHVKAMLVDSNLDV^^ 

HLQDPRKRQIIiIDSHQICDVLENDSDGTSVMGISFDTSTIPNGVYISAQGFRRMRDLRFL 
SIYETRRDPNVRVHLPEDMSFPPLLRLLHWEVYPGKCL^^ 

WQGVQPLTNLKKMDLSGSLSLKEVPDLSNATSLKRLNLTGCWSLVE I PS S IGDLHKLEEL 
EMNLCVSVQWPTLLNLASLESLRMVGCWQLSKIPDLPTNIKSLWGETMLQEFPE 
WSHLHSLNI YGSVLTVPLLETTSQEFSLAAATIERI PDWI KDFNGLRFLYIAGCTKLGSL 
PELPPSLRKLIVDNCESLETVCFPCDTPTTDYLYFPNCFMLCQEAKRVITQQSLRAYFPG 
KEMPAAEFDDHRSFGSSLTIIRPAICKFRICLVLSPAPDMEEAYFKLLFRIRAKGCPSDD 
DMLSLDLAKIQGEHLFIFHIEFVEHHEEMVFKFSTSSHEVDVIECGVQVLTDETSRRSNE 
SCSEQVSEDGDDILSDDDKSNEIYEPRVKIFTGYTMFLSLVFTFLLSLISSLILYRFLKN 
(SEQ ID N0:59) 

CDS_30|6303_bp 

atgataagaacagaacctgaatttcaaagagaagtgtttaaggaagaagacctacagcat 

gaacaatatcgccagatatgtttcctgttcagcaccacatcccaggaagtcgaaattatt 

gaatgtggcgtccagatcgtgagggacagaagtggcagaaaaagaagcgacgccgatgct 

gtgaatataaatcacaacaacaactgttggaagacgacgatgaaagtctctacgaaagag 

atgaagtttccttcaaaagaagtctcgcctgaacaagaagaaagcagaacaagccaaaag 

cgacaacaacaaggatccataaaaagttccagtttcagagactttcatcgatactctgct 

ctcatggcttcttcttcctcttcttcctcgcctcgcacatggagataccgcgtcttcacg 

agcttccacggacctgacgtccgcaaaaccttcctcactcacctacgcaagcagtttaac 

tgcaacgggatatcgatgttcgacgatcaagggatcgagagaggccacaccatcgcccct 

gctctcacacaagcgatcagagaatcgaggatctctatcgtggtgttaacgaagcactat 

gcttcttccaggtggtgtttggatgagcttttggggattctgaaatgcaaggaagagatc 

gggcagatagtgatgaccatcttctacggagtagatccttccgatgttcggaaacaaacc 

ggagatttcgggaaagtcttcaaggacacatgccgtcgtaaaacagaggaagagaggcga 

agatggagccaagctttgaccgatgtgggaaacatagctggggaacactttctcaactgg 

gacaaggaatcggagatgattgaaaagattgcgagagatgtctcaaacaagcttaatgct 

acaatctctagggattttgaagacatggttggtattgaagcacacttggataagatgcag 

tctttgttacatttagatgatgaggatggagctatgtttgctggaatctgtggccctgct 

ggcattggtaagactaccattgctagggctctacatagtcgactctctagcagttttcat 

cttacttgttttatggagaatcttcgaggaagctgtaatagtggtctcgacgagtatgga 

ttgaaactgcgtttacaagagctacttctttcaaagatttttaaccagaatgatatgagg 

atataccatttaggtgcgataccgcaaagaatgtgtgaccaaaaagttcttatcattctt 

gatgatgtggacgatctgcagcagcttgaggctctggctgatgaaactaactggtttggt 

gatggaagcaggattgtggtgaccacggaagatcaagagcttttggagcaacatggtatc 

aacaatacatactatgtggatcttccgactgacgacgaggctcgtaagattttttgtaga 

tatgctttcagacggagcttaacaccatatggttttgaaactcttgtcgaaagaacaaca 

gagctttgtggcaaacttccttttggtctccgtgttcaattttacgcggaaagaaagaag 

acgactgggaaaatcgatgcagtacttagagttggatacgacagtttacatgagaacgaa 

caaactctgtttctcctcattgccatcttcttcaactaccaagacgatggtcacgtgaaa 

acaatgctcgctgacactaacttggatgtcagactcggcttgaaaactctcgcttataag 

tctctcacaaaaatatctagccaaggaaaaatagtgatgcacaagttactacaacaagtg 

ggcagacaagcagttcaaagacaagagccttggaaacgtcggatcttaattgatcctcaa 

gagatctgcgatgttcttgagccttggaaacgccaagtcctaactgataccgatgagatt 

cgcgatgtccttgaaaatgattctggtagtagaaatttgatgggggtatcttttgatatg 

tctacaatcttacacgacatggatattagcgcaagagcttttacaagtatgcgtaatctt 

cgatttctcaaggtctacaaaacaagatgtgatacaaatgttagagtgcatttacccgag 

gacatggagtttccacctcgtctgaggttattacactgggaggtatacccgagaaagttt 

cttcctcgtacattttgtactgaacatcttgtggaactctatttaagagataccgagctc 

gagcaattatgggagggaacccagcccctcacaaatctcaagaagatgtttttgggttcg 

tgcctgtatcttaaggaacttccggatcttgcaaaagctacaaacctagagaaattgagg 

ttggatcggtgcaggagtttggtagagattcattcctctgttggaaaccttcataaacta 

gagagtttggaagtggctttctgttataatctacaggttgttccgaatcttttcaacttg 

gcatctcttgaatcattcatgatggtgggatgctaccaactgaggagtcttccagatatt 
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tctacgaccatcacagaactctcaatcccagacacactgttagaagagtttactgaacca 

attaggctctggtctcaccttcagagactcgatatatatggctgtggggaaaatttggag 

caagtgcgaagtgacatagctgttgagagaattccagactgcatcaaagatctccaacgg 

ttagaagaactaactatattttgctgtccaaaacttgtatcactgccagagctccctagg 

tcgctcacattactaatagtatacgaatgtgattcactggagacactagcacctttccct 

ttaggttctgagattgaagctctctcttttcccgaatgcttcagattggatcgagaagca 

aggagagtaattacccagctgcaatcatcatgggtatgcctacctggaagaaatatacct 

gcggagttccatcaccgggttataggaaatttcttggccatatgctcaaatgcataccga 

tttaagctttgtgccgtggtttcccctaaacaggtgatggtggaagatgaagatatagaa 

ttactgtgtcacatactcataaatggttgccccatgaagagccccattaagagcatatat 

aatcttaggattagaatccaatcagaacatctgtttatatttccctccacaatgctcaag 

gaagaccgacagcttggacaatacagtgagatattgttcaaatttagcaccacatctcag 

aacactgaaattattaaatgtggtgtccaaatcttgagggacagaagaagctgtgattct 

aagtcagaacaagacgacgatgaaagtctctacagcagccttgactatgatgcaccaaga 

gtagataccattaacttactaaaggaacataaagacttgataagtgattatttcacatca 

ttcttttccttgtggtgtttggatgagcttttgggaattctgaaatgcaaggaagagatg 

gggcagatagtgatgaccatcttctatggagtagatccttctgatgttcggaaacaaacc 

ggagatttcgggaaagtcttcaaggaaacatgccgtcgcaaaacagaggaagagaggcga 

agatggagccaagctttgaccgatgtgggaaacattgccggggaacactttctcaactgg 

gacaaggaatcggagatgattgaaaagattgctagagatgtctcaaacaaacttaatgct 

accatctctagggactttgaagacatggttggtattgaagcacacttggatgagatgaac 

tctttgttacatttagatgatgaggatggagctatgtttgttggaatctgtggccctgca 

ggcattggcaagactaccattgctagggctttacatagccgactctccagcactttccag 

catacctgttttatggagaaccttagaggaagctgtaacagtggtactgacgagtatgga 

ttgaagttgcgtttacaagagctacttctttccaagatttttaaccaaaatggtgtaaaa 

ctatttcatttaggtgccataaaggaaaggttatgcgacctaaaagttcttatcgttctt 

gatgatgtggacgatctgcagcagcttgaggctttggctgacgatactaactggtttggt 

gatggaagcaggattatcgtaaccacggaagatcaagagattttggagcaacatggtatc 

agcaatacataccgtgtggatttcccaactcaagtagatgctcgtcagatcttttgtaga 

tttgcttttagacagctctctgcaccccatggttttgaaaaacttgttgacagagtaata 

aagctttgcagcaaccttcctttgggtctccgtgtcatgggctcgtctttacgcagaaag 

aaagtagacgactgggaaggtatactgcagagactagagaatagctttgatcaaaagatt 

gatgcagtacttagagtcggatataacagtttgcataaggatgaccagttcctgtttctc 

ctcattgcatgcttcttcaactacaaagacgatgatcacgtgaaagcaatgctcgttgat 

agtaacttggatgttagactcggcttgaaaaatctcgtgtataaatctctcatacagata 

tccgcagaaggaaccatagtaatgcacaagttattacaacaagtgggtagagaagcggtt 

catttgcaagatcctaggaaacgccaaatcctaatagattctcaccagatttgtgatgtc 

ctcgaaaatgattctgatggtacaagtgtgatgggtatatcctttgatacatccacaatc 

ccaaacggagtgtatataagcgcgcaagggtttagacgaatgcgtgatcttcggtttctc 

agcatctacgagacaagacgtgatcctaatgttagagtgcatttacctgaggacatgagt 

tttccacctcttctaaggttattacactgggaggtatatccaggaaagtgtcttcctcat 

actcttaggcccgaacatcttgtggaactctgttttgttaacagcatgctcgagcagctt 

tggcaaggagtccagcctcttacaaatctcaagaagatggacttgtccgggtcactgagt 

ttgaaggaagtccctgatctttcaaatgctacaagtctcaagagattaaatctaacaggt 

tgctggagtttggtagagattccttcctctattggagaccttcataaactggaggagctg 

gagatgaatttatgtgtaagtgtacaggtttttccgactctactcaacttggcatctctt 

gaatcactcaggatggtaggatgctggcaactaagcaaaattcctgatcttccaaccaac 

atcaaatcacttgtagtcggggaaacaatgctacaagagtttcctgaatcagtcaggctt 

tggtctcaccttcatagtctcaacatatatggcagtgtcctcacagtgccacttttggaa 

accacgtcgcaagaattctccctcgctgctgctacgattgagaggattccagattggatc 

aaagattttaatgggctaaggtttctttacatagctggctgcacgaaacttggatcactg 

ccagagctccctccctcgcttagaaaactaatagtagacaactgtgagtcactagagacg 

gtctgttttccttgcgacactccaacaactgattatctctacttccccaactgcttcatg 

ttgtgccaagaagcaaagagagtaatcacacagcaatcattgagagcttacttcccagga 

aaagagatgcctgctgcagagttcgatgatcatcgatcatttggaagttccttgaccatc 

atccgtccggctatctgcaagtttaggatttgcctggtgctttctcctgcaccggatatg 

gaagaagcttatttcaaattactgtttcgcatacgtgcaaaaggttgtcccagtgatgat 

gacatgctttcgttagatctcgctaaaatccaaggggagcatctttttatatttcacatt 

gagtttgttgaacatcacgaggagatggtgttcaaattcagcacctcatcccacgaagtc 

gacgttattgaatgcggtgtacaggtcttgacagatgaaaccagcagaagaagcaatgaa 

tcttgttcagaacaagtgtctgaagacggggatgatattctatcggatgatgataagagc 
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aatgagatttatgaacccagagtaaagatatttacgggttatacaatgtttctttcttta 
gtttttacgttccttttgagtttgatttcaagtctaattctgtatagattcctaaagaac 
taa 

(SEQ ID NO: 60) 

gene complement (join (198625 . .198630,198733. .198813, 

198944. .199015,199100. . 199216, 199253 . . 199414, 
199503 . .199856,200002. .200148,200252. .200390, 
200469. .200617,200760. .200928,201066. .201146, 
201231. .201340,201566. .201577,201943 . .202050, 
202075. .202155,202241. .202304,202431. .202456, 
202555. .202641,202727. .202754,202867. .202928, 
203018. .203230,203316. .203423,203511. .203585, 
203942 . .204018,204139. .204214,204308. .204370, 
205471. .205510)) 
/gene= " Pept ide_3 1 " 

/note= n GenScan; Pl=Prom, P2«0.989, P3=0.989, P4=0.975, 
P5=0.879, P6=0.985, P7=0.993, P8=0.437, P9=0.433, 
P10=0.990, Pll=0.922, P12=0.859, P13=0.999, P14=0.42i, 
P15=0.824, P16=0.996, P17=0.683, P18=0.979, P19-0.983, 
P20=0.999, P21=0.994, P22=0.934, P23«0.729, P24=0.737, 
P25=0.990, P26=0.966, P27=PlyA« 
peptide_31 1 886_aa 

MDAPKE I FLKDYTKPD YYFETVDLS FSLGEEKTI VS SKI KVS PRVKGSSAPLVLNGHDLK 
LLSVKVEGKLLKGLYKSSGNFCTOCEAEGFRKITFYQDRPDIMAKYTCRVEADK^ 
LSNGNLISQGDVEGGRHFALWEDPFKKPCYIjFALVAGQLASRDDTFTTRSGREVSLKIWT 
PAEDLPKTAHAMYSLKAAMKWDEDVFGLEYDLDLFNIVAVPDFN^ 

VLASPETATDADyAAILGVIGHEYFHNWTGNRVTCRDWFQLSLKEGLTVFRDQEFSSDMG 

SRTVKRIADVSKLRIYQFPQVTTWVYLVFSKPIIiCCLIIQDAGPMAHPVRPHSYIK^ 

GI DL YFQRHDEQAVTCEDF FAAMRDANNAD FANFLQ WYSQAGTP WKVAS S YNAEARTF S 

LKFSQEIPPTPGQPTKEPTFIPVWGLLDSSGKDITLSSVYHNGTLQTISSSSTILRVTK 

KEEEFVFSDISEKPVPSLFRGFSAPVRVETDLSDDDLFFLLAHDSDEFNRWEAGQVIjARK 

LMLNLVSDFQQNKPLVLNPKFIQG^ 

D AVHAVRKFVRKQLAS ELKTELLKI VENNRS TEAYVFDHPNMARRALKNT ALGWHLR VI I 

KLNESLFWKSLTSLCVCFVEIMPAYLASLEDPAYVELALGEYKSATNLTO 

QKPGQTRDEVLADFYNKWQGDYLVVNKWFLI^ 

ASEFSKLLFVPQLSFSDVYSLIGGFCGSPVNFHAKDGSGYKFLGDIWQLDKINPQVASR 
MVSAFSRWKRYDETRQALAKAQLEMIMSANGLS ENVFE I ASKSLAA 
(SEQ ID NO: 61) 

CDS CDS_3l|2661_bp 

atggatgcacctaaggaaatctttctcaaggactacaccaagcctgattactactttgaa 
actgtggatctaagcttctctctaggtgaagagaagacaattgttagctccaagatcaaa 
gtttcccctcgagttaaaggatcttctgctcccttggtcttgaatgggcatgacttgaag 
ctactctctgtcaaggttgaggggaagcttctaaaggggctctacaagtcatctgggaat 
ttttgcacacaatgtgaagcagagggtttccggaaaatcacattttaccaggaccgtcct 
gatattatggcgaagtacacatgccgtgttgaagctgacaagtcactctatcctgtactg 
ttgtccaacggaaacctcatttctcaaggagatgtagagggaggtcggcactttgcctta 
'tgggaggatcctttcaagaagccatgctatctatttgctctggtggctggacagctagcg 
agcagagatgatacatttactacacgctctggtagggaggtatctctgaaaatctggact 
cctgcagaagatctaccaaagactgctcatgccatgtattctctgaaggcggccatgaag 
tgggatgaagatgtgtttggccttgagtatgacctggatctcttcaacattgtcgccgtt 
ccagattttaacatgggagccatggaaaacaagagtttgaacatttttaattccaagctt 
gtcctggcatctccagaaactgcaacagatgcagattatgctgcaattttgggagttatt 
ggtcatgaatacttccacaattggacaggcaacagggtgacatgccgtgactggttccaa 
ctcagtctaaaggaaggtctaactgtcttccgtgatcaggagttttcatctgatatggga 
agccgtactgtaaagcgtattgctgatgtttcaaagctcaggatctatcaattcccgcag 
gtcactacatgggtttatctcgttttttctaagccaatcctctgttgtttaattatccag 
gatgctggtcctatggcacatcctgttcgcccacattcatacatcaaggtttatgaaaag 
ggtattgatctttattttcaaagacatgatgagcaagctgtgacctgtgaagacttcttt 
gctgctatgcgtgatgcaaacaatgcagattttgctaatttcttgcaatggtactctcaa 
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gctggaacgccagtcgtcaaagtggcatcctcttacaatgctgaagctcgtaccttctct 

ttgaaattcagtcaggagatacccccgactccgggccagccaacaaaagaaccaacattt 

attccagtggttgttggtcttctggactcaagtggaaaagacattacactttcctctgtt 

taccataatggtacacttcagaccatttcaagcagcagcacaatccttcgagtgaccaag 

aaagaagaagagtttgtgttctctgatatatcagaaaaacctgtgccgtctctgtttaga 

ggattcagtgccccagttcgtgttgagactgatctctctgatgatgatctgttcttcctc 

ttagcccatgattcagatgaattcaatcggtgggaggcaggtcaagttctggcaaggaag 

ctcatgctgaacttagtttctgatttccaacaaaacaaaccattggttctaaacccaaag 

ttcattcagggtctcggcagcgtgctttctgactcgagcttggacaaggaatttatagcc 

aaggcaataacattacctggggagggagagattatggacatgatggccgtggcggatcct 

gatgctgttcatgccgttagaaagtttgttagaaagcagcttgcatctgaacttaaaact 

gagcttctaaagatagttgagaacaataggagcactgaggcttatgtctttgaccacccc 

aatatggctaggcgtgctttgaagaacactgctctaggttggcatttgcgagtcattatt 

aagttaaacgagtctctctttgtgcggaagtcactaacttctttgtgtgtttgctttgtt 

gaaataatgccagcttatcttgcatcgcttgaggatccagcatacgtggaacttgcattg 

ggtgaatacaagtcggccaccaatttgacagaccaaahtgctgctttggcagctcttgca 

caaaagccgggtcaaacccgtgatgaggttcttgctgatttctataacaagtggcagggc 

gattacttggttgttaataaatggttcctccttcaagcatcatccgacattcccgggaat 

gtggagaatgtcaagaagcttttggatcacccagcttttgatctgcgcaatccaaacaag 

gcaagtgaattctcaaaactcttattcgttccgcaactttctttctcagatgtttactcg 

ctcattggagggttctgcggttcgccagtgaatttccatgcaaaagatggatcaggttac 

aagttcttgggtgacattgttgtccagttagacaaaatcaatcctcaggttgcttctcgt 

atggtatctgcgttttcgaggtggaagcgctacgatgaaacccgacaagctctggccaag 

gcacagttggagatgataatgtcagctaatgggttatctgaaaatgtatttgagattgcc 

tctaagagcttggctgcttga 

(SEQ ID NO: 62) 

gene complement (join (205579 . .205584,206216.-206365, 

206457. .206579,206721. .206831,207146. .207282, 
207361. .207513,207594. .207668,207765. .207842, 
207943. .208007,208105. .208216,208295. .208405, 
208557. .208655,208726. .208765) ) 
/gene= "Pept ide_32 " 

/note="GenScan; Pl«Prom, P2=0.971, P3=0.989, P4=0.872, 
P5=0.978, P6=0.976, P7«0.972, P8«0.999, P9=0.936, 
P10=0.941, Pll=0.981, P12=0.999, P13«PlyA w 
pept ide_3 2 1 4 0 3_aa 
MRGLVKXLVSRSLSVSGKWQHQQLRRLNIHE^ 

I QQVFPNETELVVKSQILAGGRGLGTPKS6LQGGVH I VKRDQVQDI AGKMLGQVLVTKQT 
GPQGKWSKVYLCEKLSLVNEMYFSI ILDRKSAGPLI IACKKGGTSIEDLAEKFPDMI IK 
VPIDVFAGITDEDAAKVVDGLAPKAADRKDSIEQVKKLYELFRKSDCTMLEINPIjAETST 
NQLVAADAKLNFDDNAAFRQKE I FALRDPTQEDPREVAAAKVDLNYI GLDGEI GCMVNGA 
Gl^QPWTSLNCTVVE^KILTSDDKVKAIL^ 
VVVRLEGTNVEQGKRILKESGMKLITADDLDDAAEKAVKALAN 
(SEQ ID NO: 63) 

CDS CDS_32 | 1212_bp 

atgaggggattggtgaagaagctcgtgtccagatctctctccgtctccggcaaatggcag 
caccaacagcttcgccgtctcaacatccacgagtatcagggagcagagctgatgggtaaa 
tacggagtaaacgtgcccaacggagtcgctgtctcttccctcgatgaagtcaaaaacgct 
atccaacaagttttccctaatgaaaccgagttggtcgttaagagccagatcttggctggt 
ggaagaggtctcgggactttcaagagtggtcttcaaggtggtgttcacattgtcaaacgt 
gatcaggttcaagatattgctggcaagatgcttgggcaagttctcgtcaccaaacaaact 
ggtcctcaaggcaaagtagtcagcaaggtctacttgtgtgagaagttgtcactcgtcaat 
gagatgtacttttccattattctcgaccgtaaatctgctggaccgcttataattgcctgt 
aaaaagggtggtaccagcattgaagatcttgctgagaagttccctgacatgattattaag 
gtaccgatcgatgtatttgcaggtattacagatgaggatgctgccaaggttgtggatggt 
ctggctccaaaagctgctgacagaaaagattcgattgaacaagtgaagaagctttacgaa 
ctcttccgcaagagtgactgcactatgttggaaatcaaccctctcgctgagacatccacg 
aaccaattggtagctgctgatgctaagttgaactttgatgataatgctgctttccgtcag 
aaagagatttttgcccttcgtgatccaacgcaggaggatccacgagaggtggctgctgca 
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aaagtggacctgaactatatcggtttagatggagagattggctgcatggtgaatggtgct 

ggattggccatgcaaccatggacatcattaaactgcacggtggtggaggcgtttaagata 

ctgacatcggacgataaagtgaaagcaatattggtgaacatattcggtgggataatgaaa 

tgtgatgtgattgctagtggaattgtgaatgctgctaaagaggtgtcactgaaagtacca 

gtggtggttcgtctggaaggaacaaatgttgaacaaggaaagagaatcctcaaggaaagt 

ggaatgaaactcataacagctgatgatttggatgacgcagcagagaaagctgtgaaggca 

ttagctaattag 

(SEQ ID NO: 64) 

gene complement (join (208954. .208959,209570. .209769, 

210814. .210874,211045. .211084) ) 
/gene= " Pept ide_3 3 11 

/note= n GenScan; Pl=Prom, P2=0.592, P3=0.647, P4=PlyA n 
peptide_33 1 86_aa 

MEVFRNHKKIiI FTKPESFEKI APKLKFLGLLAAR IETECEGASEI GLTRGLRRWIRF VES 
CDLIYYASSGSLRFERTNCLKSSDEF 
(SEQ ID NO: 65) 

CDS CDS_33|261_bp 

atggaagttttcagaaaccataaaaagttaattttcacaaaaccggaaagttttgaaaaa 
attgcgccaaaattaaagtttttagggttactggcggcgcgtattgaaacagagtgtgag 
ggagcgtccgagatcggattgacacgaggtttacggcgttggattcgtttcgtcgagagc 
tgcgatttgatatattacgcgtcgtctggatccttacggtttgagagaacaaactgtttg 
aagtcgtcggatgagttttag 
(SEQ ID NO: 66) 

gene join(211441. .211480,211596. .211931,213462 . .213530, 

213537. .213542) 
/gene= " Pept ide_3 4 n 

/note="GenScan; Pl=Prom, P2=0.243, P3=0.206, P4=PlyA n 
pep t ide_3 4 [ 1 3 4_aa 

MNTAIVLKIRGLDRCI IKGEYAAENEKYGAITI IRQHLTEDLRDQYLNIANPLDLRTELK 
SRYTIVSLPKFITQMGLLFEWINLRFRDFRSVDEYNSALIKIVSKLKLCGEEHCLRQNER 
SHETSSRESDQKNV 
(SEQ ID NO: 67) 

CDS CDS_34|405_bp 

atgaacactgcaattgtcctgaagataagaggacttgacagatgtatcatcaaaggcgag 
tatgcagctgaaaatgaaaaatatggggcaataacaattattcgccaacatctcactgag 
gatctcagagatcagtatctaaatattgcgaaccctctagaccttcggacagagttaaaa 
tccagatacacaatagtgtcattaccaaaatttataactcaaatgggtcttttatttgag 
tggataaatctcagatttcgggactttaggtccgtagatgaatataactcagctctaatc 
aaaa t cgt 1 1 c t aaa 1 1 gaaac t at gt ggtgaagagca 1 1 gt c t acgacaaaatgaaaga 
agtcatgagacatcatccagagaaagtgaccagaagaatgtctga 
(SEQ ID NO: 68) 

gene 35 complement (join (213617 . .213622,213659. .213834, 

214893. .214958,217045. ,217060,217158. .217197) ) 
/gene= "Pept ide_35 " 

/note="GenScan; Pl=Prom, P2=0.242, P3=0.100, P4=0.191, 
P5=PlyA" 

peptide_35 1 85_aa 

MIHQPVPKLLPKKFIVSFALNRVIPLLHLOT^ 

CSFDADHSTMSSCSGRMDDAQSVLG 

(SEQ ID NO: 69) 

CDS CDS_35|258Jbp 

atgattcatcaaccagtgccgaagctcttacccaagaagttcatagtttcgttcgctctc 
aatcgagtgattccacttctgcatctttggacgatgcattcctatctggtcgatgagctt 
cttgtaatactttgtaatttcgccggctttctggaacttcatttggacatggtcaagacg 
tgctcctttgatgctgaccactcgacaatgtcttcatgttctggtcggatggatgatgct 
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caatcagtactcggatag 
(SEQ ID NO: 70) 

gene complement (join (217877 . .217882, 218055. .218153, 

218272. .218359,218463. .218569,218660.-218749, 
218827. .218970,219509. .219621,219923.-220006, 
220093. .220204,220949. .220988) ) 
/gene= » Pept ide_3 6 tt 

/note^'GenScan; Pl=Prom, P2=0.921, P3=0.998, P4=0.782, 
P5=0.999, P6=0.998, P7-0.997, P8=0.996, P9=0.829, 
P10=PlyA" 
peptide_36 1 278_aa 

MFARNIRLRKEYLYMKNLEGEERRLYEKKRKIREAI^EGKP 

DQNTSWPGSHIDDEYAKATEKDPKILMTTSRDPSAPLTRFVKVISEIIEMARSHDYSDV 
I FVTENRGRPDGL I VSHLPFGPTAYFQLLNVVTRYD I QSKKATGKMSEQYPYVT FDNFTS 
QMGKRVGSMLKHMFPVPKLDARRI VTFRNKSDYI SFRNHVYDKGEGGPKS I ELKEVGPRF 
ELRLYQVKLGTMEQEEAEMSWVLRPYMNSAKKRRFIGQ 
(SEQ ID NO: 71) 

CDS CDS_36|837_bp 

atgtttgccaggaacattaggttaagaaaggagtatctctacatgaaaaacttggaaggt 

gaagagcgtcggctctatgagaagaagcggaagataagggaagccctgcaagaagggaag 

ccgattccatctgagctccgaaacgaggaggcgaagcttcgtgaggagattgatcttgaa 

gatcaaaacacctccgtagttccagggagtcatattgatgatgaatatgcaaaagcaacg 

gaaaaagatcccaagattttgatgactacgtctagggatccaagtgctcctctaacgcga 

ttcgttaaggtcatttctgagattattgagatggcgcgttcgcatgattatagtgatgtg 

atatttgttactgagaaccgtggtaggcctgatggccttatcgtttctcatctcccattc 

ggaccaactgcttactttcaattacttaatgtggtaacaaggtatgatatccaaagcaag 

aaagcaacggggaagatgtctgagcaatatccttacgtcatttttgacaactttacaagc 

cagatgggtaaaagagttgggagcatgttaaaacacatgttcccagttccaaaactggat 

gcaagacgtatagttactttccgtaataaatctgattatatttctttcaggaatcatgtg 

tatgataaaggagaaggaggcccaaaatcgatagagttaaaagaagtcggtcctcggttt 

gagttgcggctctaccaggtgaaattaggaacaatggaacaagaggaagcagagatggaa 

tgggttcttagaccctacatgaactctgctaaaaaacgccgctttatcggccaataa 

(SEQ ID NO:72) 

gene join(221006. .221045,222322. .222833,223076. .223194, 

225250. ,225287,225478. .225483) 
/ gene« " Pept ide_3 7 !l 

/note="GenScan; Pl:=Prom, P2=0.575, P3=0.919, P4=0.363, 
P5=PlyA" 

pept ide_3 7 1 2 22_aa 

MAAITSLQAIHLKLGRRGSIRCGISEPSGEPAPVGQiCTRYNDGLAERV^ 
GGSKKKKKDETKEKEFWEYDYESFVEVSKGVMQGRSRAQQQEVVREVLLSMLPPGAPEQF 
RKLFPPTKWAAEFNAALTVPFFHWLVGPSQVI EVEVNGVKQRSGVRI KKCSTCLFTWDF 
EDMSCEMIYGQVPPTFEEDPATKQPCLADICSYTSKPNPPEP 
(SEQ ID NO: 73) 

CDS CDS_37 | 66 9_bp 

atggcggctattactagtctccaagcaatccatctcaaactcgggagacgtggcagcatc 
cgatgtgggatctcggagccgagcggagagccagctccggtagggcagaagactagatac 
aacgatggcttagctgagagagtgttcatggggctgttcgcgaggaagatggacaagttt 
ggtggctcgaagaagaagaagaaggatgagacgaaggagaaagagttttgggaatacgac 
tacgagagcttcgtcgaggtttcaaagggagtgatgcaaggacgatcaagggcacagcag 
caagaggttgtgagagaggttcttctctccatgctccctcccggcgctcctgaacagttt 
agaaagctcttcccaccgacgaaatgggctgcggagttcaatgcagctcttacggtgcct 
ttctttcactggctggttggtccttctcaggtcatagaagtggaagtgaatggtgtgaaa 
cagagaagtggggttcgtataaagaaatgcagtacttgcctatttactgttgtagatttt 
gaagat at gagt t gcgagat gat at acggacaagt gcct cct acat 1 1 gaagaagat cca 
gccacaaaacaaccttgtttagcagacatatgttcttacacatccaaaccaaatccacca 
gaaccttaa 
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{SEQ ID NO: 74) 

gene complement (join (226644- .226649,227468 . .227721, 

227729. .228119,228766. .228805) ) 
/gene= M Pept ide_3 8 " 

/note« w GenScan; Pl=Prom, P2»0.923, P3=0.400, P4=PlyA" 
peptide_38 1 214_aa 
MDNVIAKKFTAMFNHRRLNLSRSSLFTAS^ 
KTVVPYLPLRSEREQSNYTFTVKKDNNLHVLDVF 

MTWISPAVMFERFMRAITFQFETLSSSKSGVIQNLKGSDQIKILTNSETISLPVNDPRKI 
I HNTKPRDS RAVGNL YRNNVKEEMYASDFCCCQV 
(SEQ ID NO: 75) 

CDS CDS_38|645_bp 

atggataacgtcatcgccaagaaattcacggcgatgttcaatcaccggcggttaaacctc 

tcacgatcatcactcttcacagcatctgcagcctcagtcatatctctcatcgttttcacg 

atcttcatcgtctctcatgtactggttagagacttcacagaggttgtgacgatagagatc 

aagacagttgttccttacttacctctgaggtcagagagagagcaaagtaactatacgttc 

acagtcaagaaggataacaatcttcacgttctcgacgtttttggaggcagagacgtgtcg 

gggaagtttcagcagagagtaacagagtttctgagagaagattgcgaagtctacttcatg 

atgacgtggatctctcccgcggttatgttcgagaggttcatgcgagccatcacatttcaa 

ttcgaaactctgtcgtcctcaaaaagcggagtaattcaaaatctgaaaggaagtgatcag 

atcaaaattcttacaaacagtgagactatttccttacctgtgaacgacccgaggaagata 

atacataatacaaaacctcgagattcccgagccgtagggaatttgtatcggaacaacgta 

aaggaggagatgtacgcaagcgacttctgttgctgtcaagtatga 

(SEQ ID NO: 76) 

gene join(229427. .229466,229533, .229550,229588. .229880, 

232896. .232914,233006. .233230,234577. .234582) 
/ gene= " Pept ide_3 9 " 

/note="GenScan; Pl=Prom, P2=0.588, P3=0.266, P4=0.420, 
P5=0.513, P6=PlyA ,! 
peptide_39 1 184_aa 

MVSRNKFAHNFTEKSRFSDYGLYRSVAKFLEKAFLEQREKGEEENASLLQICRGTTQTES 
ARRFYETLDWSGFSVRFNWGFIGFQVLETIiLIDLRW 

DGVFGPQPMGVPPAQQMSRFDQPAPPVGYPPASYPPAQGYPPAPYPPAQGYPPASYPPPG 
YPQH 

(SEQ ID NO: 77) 

CDS CDSJ3 9 | 555_bp 

atggtcagcaggaacaagtttgcacataatttcacagaaaaatccagattttctgattat 
ggattatacaggagtgtggcaaaattcctggagaaagcgtttctggagcaaagggagaaa 
ggtgaagaagaaaatgcgagtttgttacagatttgcagaggaacaactcagacggaaagc 
gcaagacgcttctacgagactttggatgtcacatctggtttctctgtccggttcaacttt 
gttggatttatcggttttcaagtattggaaactctcttaattgatttgagatgggcacta 
acgtgttctagggtctgcgcgtgtatgcagactcaacacaagcttgaaatggacaaaaga 
gatggagtgtttggtcctcagccaatgggagttccaccggcacagcagatgtcccgtttt 
gaccaacctgcccctccagtcggctaccctcctgcgtcttacccaccggctcaaggctac 
cctcctgcaccttacccaccggctcaaggctaccctcctgcatcttatccgcctcctggt 
. . tatccccaacattga 
(SEQ ID NO:78) 

gene complement (join (235142 . .235147,236154 . .236602, 

241938 . . 242031, 242357 . . 242443 , 242643 . . 242747 , 
242822. .242884,243333. .243538,243887. .244712, 
248084 . . 248259, 251494 . . 251614, 253629 . . 253668) ) 
/gene* n Pept ide_4 0 n 

/note= n GenScan; Pl=Prom, P2=0.369, P3=0.281, P4=0.229, 
P5-0.841, P6=0.843, P7=0.996, P8=0.954, P9o0.712, 
P10«0.019, Pll=PlyA" 
pept ide_4 0 1 6 8 8_aa 
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MFNAWIHREYVDNEDDYCLLI HS PTI S F YRPEQYKLTTEPKNYVFS FGEKLCF S VLMGNC 
VFDFDRKFGFSVLAE I AFFGFGGKLYFS ILAGNYI FPVLRGFQPQDTDVI I AS YPKSGTT 
WLKAL WALLERS KNHSSDHPLLYHNPHGI I PFLEIDVYHESSSPNLAKFSAPPRLFSTH 
MPLHTIHEALKHSPCKIVYVCR23VKDTLISCWFYSCAIYKIEPTRRVLESMFNEFCT 
YFGPFTOHLLSYWRGSLEDPKHVLFMRYEEMKAEPRDQIKRLADFLGCPFTKQEEDSGSV 
DGILDLCSLRNLSSLEANKTGTINNVEHKPFFRKGEVGDSKNYLTSEMEKDRHDHQROS 
RFWFEFLEFLEFYYYVEQNICFTFGSGLLDFDPSEVRREIVFEFSSSSHDFDIIERGSFE 
SKMHDF YGDDGGDSETS I SVSVI ENMKEEYGLFVWPCSVI LAEYVWQERSRFRGSSVLEL 
GAGTSLPGLVAAKVGANVTLTDDSSKTEVMGLTWGVWDAT^ 

LIRKLISTLFWFIGLPDFLEPEPVTSSWPDPPPNSTTSSHHDSHLFSAALLAGQILPVVR 
FSDLNRPESECCAVCLYDFENDDEIRRLTNCRHIFHKECLDRWIMDYSQ^f^CPLCRTQFV 
PDELQTDFSQKLWSESSEDVSELLAQSS 
(SEQ ID NO: 79) 

CDS CDS_40|2067_bp 

atgttcaacgcttggattcatcgagagtatgtcgataacgaggacgattattgcttactg 

atacatagcccaactatcagtttttacaggcccgaacaatacaaattaacaactgagccc 

aaaaattacgtttttagttttggcgagaaattatgtttctcggtattgatgggaaattgc 

gtttttgattttgacaggaaattcggtttttcggttttggcagaaattgcatttttcggt 

tttggtggaaaattgtatttttcgatcttggcaggaaattacatttttccggttttgaga 

ggttttcaaccgcaagacactgatgtaatcattgcttcgtaccccaaatcaggcactact 

tggctcaaggccctcacagtcgctctgcttgagagatcaaagaaccactcttctgatcat 

cctctcctatatcataatcctcatggcattataccattcttggagatcgatgtgtaccac 

gaaagctcaagtcctaacctagccaagttctcagcacctccgaggctgttctcgactcac 

atgccactgcacacgatccacgaagcactcaagcactctccttgcaagattgtgtacgtg 

tgcaggaacgtgaaggacacgttgatctcgtgttggttttacagctgtgctatatataaa 

atcgaaccaaccagaagagttctcgagtctatgtttaacgagttctgcgatggaaccaac 

tattttggacctttttgggatcatctcttgagttactggagaggaagcttggaagaccca 

aagcatgtccttttcatgaggtatgaggagatgaaagccgagcctcgtgatcagatcaag 

agacttgcggacttcttgggatgtccttttactaagcaagaagaagatagtggatctgtg 

gacgggatcttggacctctgctctctgcgtaatctgagcagtttggaggctaacaaaaca 

gggacaataaacaatgtggagcacaagttttttttccgtaaaggagaagtcggtgactcg 

aaaaattatcttacgtctgaaatggagaaagatagacatgatcatcaaagaaaaacttca 

aggttctggtttgagtttttagagtttttagagttttattattatgtcgaacagaacatt 

tgttttacatttggctctggcttgctagattttgacccatcagaagtgagaagagagata 

gtgttcgaattcagcagcagttcccacgacttcgacattattgagcgtgggagctttgaa 

tctaagatgcatgatttctatggcgatgatggaggagattccgaaacttccatctctgta 

tctgttatcgagaatatgaaggaagagtatggtttgttcgtttggccttgtagcgtcatc 

ctcgccgagtacgtctggcaagagcgatctcgatttcgtggctcttcagttctcgagcta 

ggagctggcacttctttacccggtttagtagctgctaaggttggagctaatgtcaccctt 

accgatgactcaagcaaaacagaggtaatgggtctcacttggggagtgtgggatgcaacc 

atatttgatctgcggcctaacattatacttggagcagatgttctatatgattcaagtggt 

ttgatacgaaaactaatctccacacttttctggttcattggtttacccgatttcttggaa 

cccgaaccggttacatcttcatggcccgacccaccaccaaactccaccacctcaagccac 

catgactctcacttgttttcagcagcgctgctagctggacagatcttgcccgttgtcaga 

ttctcggatctaaaccgacccgaatccgaatgttgtgcagtgtgtctctacgacttcgag 

aacgacgatgagatccgacggctgacgaattgcaggcatattttccataaagaatgcttg 

gaccgttggattatggattatagtcagatgacgtgtccgctttgtcgtacccagtttgta 

cctgatgagcttcaaacggactttagtcaaaagctttggtcggaatctagtgaagacgtt 

tctgaacttcttgctcaatcatcttag 

(SEQ ID NO: 80) 

gene join (254094 . .254133 , 254399 . . 257236 , 257535 . . 257678 , 

257835. .258083,258167. .258907,259556. .259561) 
/gene« !, Peptide_41 » 

/note=»GenScan; Pl=Prom, P2«0.350, P3=0.377, P4=0.685, 
P5=0.683, P6=PlyA" 
peptide_41 | 1323_aa 

MSKQIDELRSSQNQQTEELGSKINALEALIEKYFANAPPPQRDGKQTDASSDITDGTPQA 
KAPPDRSNPENSSFKPHDNNNPPIHHSLSARLTKIGFPMFDGSELREWTYGCEQFFSIDS 
TPPELKVRLASLHMTGKALQWHHS YLANRYNI FSLWPEYVAAI SDRFSELYDDPLAELVS 
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LKQGNDTIDVYLDKFDCAMTRITLAPDHALS I FLTNMNQHLALHVRQFKVSTVPEAAKIA 

KLHELSLSHMPTKTSRPPFNSSQRSNYSQPNKSQNHNSTSPTTTAN^ 

LSFDEMQERKRKGLCMFCEEPFTPGHHLKHKRAEFLFIjDLDAETEFDDEIALVEQIRETT 

ISDDDDKVPTISVHALNGAPTFNCMRLVRKYEKRK^ILIDPGSTHNFLDIQMAKGLGCS 

LTPIKPMSVVA7^GDLVTKYKCSSFAWKMQGYGFTAEIRTLPLGCSDLVIiGVQWLSTLGP 

ILWDFLNLRMEFKFNELKHVIiRGI S PNSSKLI SGSS FNKLMLQDPQLALLHLREIDETTE 

QEPLEPETIFCHIEASETENDNSGSLERLLDSYTDVFDEPSTLPPYRAGFNHKIPLEAGS 

NPVNLRPYRYSSIQKDSIDKMIQDMLSQGIIQYSASPYASPIVLVKKKDGSWRLCVDYRG 

LNKQTI KDKYPI PLLEDLLDELGGSKYFSKLDLRAGFHQLCMS PEDVHKTAFKTHSGHYE 

YLVMPFGLTNAPCTFQGLMNHVFAPVLRKFLLVFFDDI^ 

HQQLYLKKSKCTFGGTRIEYLGHFISHDGVSTDPTKIKAVEEWPQPKHQKHLRSFLGL^ 
YYRRFIQGYS 1 1 ARPLTIMLRKDGFAWNTEASDAFHLLKQALI SAPVLALPDFSKTFIVE 
TDASNTGI GAI LMQDNHPVCYI SRALGPRHQGLSVYEKELLAVVHAI I TDLQNKPGSHAA 
YSFVNGELRRRGKLWGNDPAI KLHI FKWLHDSAPLPVPTGVWESVSLDFIEGLPPSSGK 
HCILWIDRLSKNAHFLALSHPYTAmVAKLYMDQVFRLHGMPKDITS 
KTLETYLRCMTSDSPSTWSAWLPLAEWWYNTTYHTAIRSSPFEIIYGQPPPVHL 
STSTTVDRSLQRREEL IDMMKFHLLRAQNRMKQYADSHRSERAFHI GDYVYLKLQPYRQH 
SLKGRHLPHKLSPRFYGPYEIQDRVGNLAYKLRLPFEAAIHNVFHVSQLKLGPNPPATPS 
SLPQYLKDVGTAKEPEKI LETKMVNRRNRAVTKVLVQWKGYS PEQATWEFYQDFVAKHPD 
FNT 

(SEQ ID NO: 81) 

CDS CDS_4l|3972_bp 

atgagcaagcaaatagacgagctacgttcttcgcaaaaccagcaaactgaagaactcggt 

agcaaaatcaacgcactcgaagcactcatcgaaaagtacttcgctaacgctccacccccg 

caacgcgacggtaaacaaacagacgcaagttctgatattacggatggaacaccgcaagct 

aaggctccaccagaccgttccaatccagagaacagttcctttaaacctcacgacaacaat 

aacccacccatccatcatagcctatccgcaaggctaacaaagattggctttccaatgttt 

gacggctccgaactacgagaatggacctacggctgtgaacagttcttctccatcgacagc 

accccaccggaattgaaggttcgtcttgcatctcttcatatgacgggaaaagcactacaa 

tggcatcactcttaccttgccaatcgatacaacatcttttcattatggccagaatatgtt 

gctgcgatctccgatcgtttcagtgagctttacgacgatccattagcagagttggtaagc 

ttgaaacaaggaaacgataccatcgatgtgtatctagataaatttgattgcgccatgacc 

agaatcacgcttgcgccggatcacgcattgagtatattcttgacaaacatgaatcaacat 

ctagctcttcacgtgcgccaattcaaggtcagtacagtacctgaagccgcaaagatagcc 

aaactacacgagctctccctctcacatatgccaacaaagacatcgcgccccccattcaac 

tcttctcaacgatcaaactactcccaacccaataaaagccaaaaccacaactccacctcc 

cctactactaccgccaatccaaacaacaaacccctcattgcgaatgctcctcaaaaatgg 

ctttcttttgacgagatgcaggagcgcaaacgtaaaggattatgcatgttctgtgaagaa 

ccgtttacaccaggtcaccaccttaagcataagcgcgctgaatttttgttcttggactta 

gacgctgagacagaatttgacgacgagattgcactagtggagcaaatccgtgagacaacc 

ataagcgacgatgatgacaaagttccaactatctccgtccacgccctcaacggtgcgcca 

actttcaactgtatgcgcctcgtcaggaaatacgagaaacgtaaactacatatactgatt 

gatccagggagcacacacaacttcttggacatccagatggctaagggattaggttgttct 

ttgacaccaatcaaaccaatgtcagttgttgcagcaagtggcgacttggttactaagtac 

aagtgcagctcttttgcttggaagatgcaaggctacgggttcacagctgagattcgaacc 

ttaccactaggatgcagtgatctcgtcctgggggttcaatggctttccaccttaggacca 

atcctatgggatttcctaaaccttcgtatggagttcaaattcaacgagctaaaacatgtc. 

ttacgtggaatatcacccaacagctcaaaattaatttctggaagcagctttaacaaactg 

atgttgcaagatccacagcttgctctactccatcttcgcgagattgacgaaactacagag 

caagaaccccttgaaccggaaacaattttctgtcatattgaagcgagtgaaacagagaac 

gacaattccggttcactcgagagactccttgattcatacacggacgtatttgatgagccg 

tcgaccttacctccttatcgcgcgggtttcaatcacaaaataccattggaagcaggatca 

aatcctgtgaatctccgaccatatcgatactcttcgatacaaaaggattcaatagacaag 

atgattcaagacatgctttctcaaggtatcattcagtacagtgcaagtccctacgcttca 

cctattgtacttgtgaaaaagaaagacgggtcttggcggctttgtgtcgactacagaggt 

ctcaataagcaaacgatcaaagacaaataccccatcccactactcgaagatctccttgat 

gaattgggcggctcaaagtacttctctaaactggatttacgtgcaggtttccaccagctt 

tgtatgtctccagaagatgtgcacaagacagcttttaaaacacattcaggccactacgaa 

tatttggtaatgccattcggcctcacaaacgcaccttgcacgtttcaggggcttatgaat 

cacgtattcgcacccgttctacgaaagtttctccttgttttcttcgacgatatattaatc 
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tacagcaagacctgggaggagcacctggatcacttggacaaggttcttgctatacttcgc 
catcaacaactctatctcaagaaatcaaagtgtacttttggaggaacgagaatcgaatac 
cttggtcactttatttcccatgatggcgttagtactgacccaaccaagataaaggcagtc 
gaggaatggccacaaccgaagcaccagaaacacc t ccgcagct t ct t aggt ct agccaat 
tactatcgaagattcatacaagggtacagcattatcgctcgaccccttaccatcatgctt 
cgcaaagacggttttgcttggaatacagaggcttcagacgcgttccatctcctcaaacaa 
gcattaatctcggccccggtccttgcactccccgatttctccaagactttcatcgtcgaa 
accgacgcttccaacactggcataggcgcaattcttatgcaggacaatcacccagtgtgc 
tacataagtcgtgcattagggcctcgacaccaaggcctttctgtttacgagaaggaactc 
cttgctgtggtccacgcaatcattaccgacctgcaaaacaaacctgggtctcacgccgcg 
tactcctttgtcaacggagaacttcgccgccggggtaaacttgtggttggtaacgaccct 
gctatcaaacttcatatcttcaagtggcttcatgattctgctccgttaccggttcctacg 
ggtgtatgggagtctgtgagccttgactttatcgagggtctaccaccatcatcaggaaaa 
cattgcattctcgtcgttattgatcgtttaagcaagaatgctcactttctcgccttgtct 
catccgtacactgcaatggacgtggcaaagctatatatggatcaggtctttcgtcttcat 
gggatgcctaaagacattacaagcgatcgtgacccaacgtttctcagcgaggttactaac 
aaaactttggaaacctatctccgatgcatgacttctgattcaccttctacatggagcgcg 
tggttacccttagcagaatggtggtataatactacttaccacactgcgattcgcagttca 
ccgttcgagatcatctatggccagccacctccagtgcacttgccttaccttcccggcgaa 
agtacttctaccacggttgacaggtccttacaacgaagggaagagcfccattgacatgatg 
aagttccacctcttgcgagctcagaacaggatgaaacaatacgcagactcgcaccgttct 
gaacgtgcgtttcatattggtgactatgtctatttgaagctccagccgtatcgacaacat 
tccctcaaaggacgacatttgcctcacaagctttcaccacgtttttatggtccgtacgag 
atacaagatcgtgttggtaacttggcatataagctacgtcttccatttgaagcagctatc 
cataatgtcttccatgttagtcagctaaagctcggccccaacccgcctgcaactccttcg 
tcgcttccccaatacctcaaagatgtcggcactgcaaaggaaccagaaaagattcttgaa 
acaaaaatggtgaatcgccgaaacagagctgtcacgaaggtactagtacagtggaaaggc 
tactctccggaacaagccacatgggagttctatcaagacttcgtcgcgaaacatcctgat 
ttcaatacttga 
(SEQ ID NO: 82) 

gene join (259921. .259960,260542. .260668,260773. .260900, 

261168. .261305,261389. .261454,261590. .261702, 
261789. .261899,262016. .262053,262136. .262244, 
262377. .262407,263398. .263535,263662. .263667) 
/gene« n Peptide^ 42 " 

/note= n GenScan; Pl=Prom, P2=0.999, P3=0.938, P4=0.995, 
P5»0.946, P6«0.803, P7=0.899, P8«0.700, P9=*0.950, 
P10=0.623, Pll=0.924, P12=PlyA" 
pep t i de_4 2 1 3 3 2_aa 

MIHWGGVTCCLSAAALYLLGRSSGRDAEVLKTVTRVNQLKEL^ 

SGRVGSDTPIKCEHSGIRGVIVEETDIX5TSRVNVVGARGATGFALTVGSEVPEESGRSLV 
RGTLDYLQGLKMLGVKRI ERVLPTGMPLTI VGEAVKDDI GDLRI QKPERGPP YVS PKSLD 
QLISNLGKWSRLYKYASMGLTWGVFLITKHVIDFLLERRQRR^^ 
GSNG^ESVSDSTK3raGAVPDLCVICLEQ^^^ 
TSKGGRLELRTSETNLVDPDS ETE 1 1 SMTDRS 
(SEQ ID NO: 83) 

CDS CDS_42|999_bp 

atgattcattggggtggagtaacctgctgcctcagcgccgccgctctttatcttctcggc 
cggagtagtggcagggacgctgaagtactcaaaaccgtcactagggttaaccaactcaag 
gagctagcgcaattgctagaattagatagcagcaagctccttcctttcatcgtagccgtt 
tcaggaagagttggctctgacactcctatcaagtgcgagcatagtggcatacgcggcgtt 
atcgtcgaggaaacggacgatgggacaagtcgtgtgaatgtagtgggagctcgtggtgca 
acaggttttgctttgaccgtcggaagtgaagtttttgaagagtcagggcggtctcttgta 
cggggaacacttgattatctccaaggccttaagatgcttggagttaagcgcattgagcgt 
gttcttcctactggaatgcctctcacaattgttggtgaggctgtcaaggacgatattggg 
gacctaaggattcagaaacctgaaagagggcctttctacgtctctcctaaatcactcgat 
cagctcatttctaatctggggaaatggtcaaggttgtacaagtatgcctccatgggttta 
actgttttcggtgtgtttctaattacaaagcatgtcattgattttcttctagagagaaga 
cagcggcgagaaatacagaaaagagtgcttgatgcagcagctaagagagctgggactgaa 
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ggttcaaacggcgcacatgagagcgtatcagattctaccaagaatgaaggcgctgttcct 

gatctctgtgtgatctgccttgagcagaactacaatgctgtgtttgtcccctgtccactt 

tgtcggagacgtatagatcaggttatcaagttggatatcggccatatcgacccatcgatc 

acatctaagggcggtcggttagaacttagaacctcggaaaccaatctggtcgacccagat 

tctgagacagagataatcagtatgacagatagatcctaa 

(SEQ ID NO: 84) 

gene complement (join (263813 . .263818,263901. .264080, 

264138. .268481,268603. .268619,268932 .. 270282, 
270383. .270724,270866. .271945)) 
/gene«"Peptide 43 w 

/note="GenScan7 Pl=0.999, P2=0.973, P3=0.024, P4=0.030, 
P5=0.016, P6=0.564, P7=PlyA M 
peptide_43 | 2437_aa 

MIAKI AKDVLQKLNATPCRDFEGI V6LDAHLKE IESLLDLDYDGVKMV^ TGPAGIGKTT 
IARALHSLiLSRRFQLTCFVDNIjREI^ 

VKENLCDQKVLIILDDWSLKQLEALANESAWFGPGSRIVVTTENLEILQQHGI^ 

GFPFDEEAIEILCRCAFRLRSPLYGFEVIiCTRIIRLCGNLPLALC^ 

DVVNRLETILDQGIEDVLKVGYESLEENEQTLFLHIAWFNNEDSDLVKAMFSDGTIDVK 

RGLQILVYRSLI EISTYGNI I EMHKLLRRVGRQAIHKQEPWKRQILLDAHEICDVLENET 

GTRAVSGISFDISGI^LIVDERAFKRMYNLQFLKVYKSRDDKLKFDILAGEKSRDDMND 

VIHIPEEMKFPRRLRFLHWNAYPNKCLPPSFHPEYLVEIjDMKHGKFEHLWK 

KLDLSFSCQLKELPDLSRATNLKILDMSYCKSLTO 

PDHLNJ^LGIVEIWGCSKLRKFLFISTTC 

GLTHLHKSLICLDLRYSDIETIPDCIKDLHQL^ 

CESLETVFCPLNSSPNAVLDFTNCFIOjGQQARREIIHRSLSCEWILPGKKVPAKFDHRA 
RGSSLTIRI PYGNNPLSAVS ILKLCVWS PNHQISKKSRHSEYLVCRCI SKGDLDPWEE 
FWSWSRYRSEHXiLIFHPRLPFIVPSETV'SRETVFEFSSKSHEFDVIECGAKVFEDKSIQ 
GSYESGSDQVSEDHIDYLTDGRYESCEDDINSLIDESLELNPANPLQATPYESSEEREWS 
EDEYELSEQYRQSRRQFRAQCRRRRSSIFEKSPFLGTETMVETRLQERSLTEQVDELRSL 
HDLLAAELKSRSDSLDARFDRLEALMFSANASPLHATGKAPLDPGPSHPPTPFNLGSSQP 
PDPPDI^GFRQHNEFGRLTSRLSKISFPKFDGTDLRDWLSKCEQFFDIDGTSQELKVRLA 
AMHLTGRATQWHTNYMSTRFGMF PSWTDYI IAI SARFCELFDDPLAELVALKQGSDLVAA 
YLDKFETTRMRLVLPEAHALSIFLANMNPHLSLHTRQFETTSIAGAAKIAMLHESSLSHT 
PNRQRAPFNPYPNTKPYQKPNNPSPLLPLTQAPTNQKPSFIPRNPTDKPPRKFSYQEMQD 
RRSKGLCMFCDEPFTPGHQLKHKRSQIYVMEODDADTIPDDSSSDAEPESDDKQATVEVT 
PVI S INALNGSTS YNCMRLIGHHGKHKLHI LVDPGSTHNFVDLNI ATQLGCELE PTRPMS 
VKAATGDTLLTNFKCS AFTWTVQGS S FTTE I RTVPLD CCDFVLGVQWLCTLGPI LWDFLN 
LRJ^FTLSGTKHVLRGVVKI^KVIKGSSLNKLMLQEP 

EMLYSHI SASGTTNADDPAIiQQLLQTFEDI FQEPKSLPPFREGFDHQI PLLAGSNPVNLR 
P YRYS S LQKDT I DTM I KEMLTQG 1 1 QHS AS P YAS P I VL VKKKDGT WRLC VD YRGLNKQT I 
KDKYLI PLLEDLLDELGGAKYFSKLDLRAGFHQLRMSEDDVYKTAFKTHQGHYEYLVMPF 
GLTNAPCTFQSLMNHWQDLSRKSVLVFFDDILVYSKTWEEHI^HIjAEVFLILQQQQLYL 
KLSKCTIGATI IEYLGHF I SADGVSTDPRKI AVIRDWPI PTTQKHLRSFLGLANYYRRFI 
KGYSSIARPLSTLLKKDGFTWSLEADQSFSDLKAALSSTPVLALPDFDQPFIVETD^ 
GIGAVIiMQGKHPICFISRSLGPRHQNLSVYEKELMALVHAVQTWHPYLAHRPFIIOT 
SLKYLMEQKITTPFQHMWLSKLMG 

TGFYDSLKLLWETDATLQKIICDLKSNSSSHPLFTYTNGELRRRGKIiVVGNDKDIKIiHIF 
KWLHDSAI GGHSGRDATLHRIRSLF YWPKLNVEVQNYIRNCS I CQQNKYDMAAKPGLLQP 
LPIPDGIWESISLDFIEGLPPSQGKHCIMVVVDRMSKNMIFIPLSHPYTALTVAQAFLDN 
I FKLHGMPKDVI SDRDP I F I SEVWNELFRVQGVTLKRSTAYHPQTDGQTEVTNKTLETYL 
RCMAAETPTSWSKMjSLAEWWYNTTFHSAIQATPYEVIYGQPPPLHLPYLPGESSSVVVD 
RSLQKREEVINMLKFHLLRAQNRMRQYADAKRSQREFKIGDYVYLKLQPYRQHTVKK^ 
PHKLSPRFYGPFRVLDRIGKVAYKLSLPTEAAIHDTFHYl^ 
QNEAATKVL VHWKDES PEMATWEFYKDFTNKYPLFNP 
(SEQ ID NO: 85) 

CDS CDS_43 | 73 14_bp 

atgattgcgaagattgcaaaagatgttttacaaaaactgaatgcaacaccgtgtagagat 
tttgaaggcattgtgggacttgatgctcatctaaaggaaatagagtctttgttagatctg 
gattacgatggagttaagatggttgccatcactggtcctgcagggatcggtaagactacc 
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atagctcgagctttacatagtctactctcccgtaggttccagcttacttgttttgtggac 
aaccttagggaaatctgtcctagtggtcttgacgaatatggttggaagaaaaggttacaa 
gaaaattttctttcaaaggttttgaaccacaaggacatgaggatatgccatttaggtgca 
gtaaaagaaaacctatgcgaccagaaagtgcttatcattcttgatgacgtgaacagtcta 
aagcaattagaggcgttggctaatgagagtgcatggtttggtcctgggagtaggattgta 
gtgaccacagaaaacctagagattttgcagcaacatggtatcggtagcacttaccatgtg 
gggtttccatttgatgaagaagctatcgagatcttatgtagatgtgcttttagactaagg 
tctccactttatggttttgaggtgctttgtacaaggataataaggctttgtggtaatctt 
ccattggctctatgtgtggtgggtaaatctttacgtgggaagaagaaggacgaatgggaa 
gatgtagtgaacagactggaaactattcttgatcaaggtatagaggacgtgctaaaagtg 
ggctatgaaagtttagaggagaatgagcaaaccctatttctccacattgcagtcttcttc 
aacaatgaagatagtgatcttgtgaaagccatgttctctgacggtacaatagatgtcaaa 
cgcggtttgcagatcctagtctatagatctctcatagagatatctacatatgggaatata 
atcgagatgcacaagctactacgacgagtgggtagacaagccattcataaacaagagcct 
tggaaacgccaaatcttactggatgcgcatgagatttgcgatgtcctcgaaaacgaaaca 
ggaactagagctgtgtccggcatatcatttgatatatcaggaatcaacgacctgattgtt 
gacgaaagagcttttaaaagaatgtataatcttcagtttctcaaagtctacaaaagcaga 
gatgacaaactaaaatttgacattcttgctggggaaaaaagcagagatgatatgaatgat 
gtaatacatatacctgaggagatgaagtttccacgccgtctaaggtttctacattggaat 
gcataccctaacaagtgtcttcctccttcctttcatcctgaatatctcgtggaacttgat 
atgaagcatggcaaattcgagcacctctggaaaggaacccagcggcttgcaaatctcaag 
aagcttgatctgtcattttcatgccagttgaaggaactcccggatctttcaagagcaaca 
aatctgaagatattggatatgagttattgcaagagtttggtagagattccatcctctttt 
atgcatcttcagaaactagaattgtggtggatggatgattgcgtaaacctacaagtcatt 
ccagaccacctgaacttggcttctcttggaattgtcgaaatgaacggatgttcaaaattg 
agaaagtttctgtttatttcaactacttcaccgagattgagtatatgtggggactctttt 
cgacaacggccgccatcatctggtatgtactctcgtataaggaaaagtgggaaactcaag 
ggattaacacatctccacaagagtctaatatgtctagacctacggtattctgatattgag 
acgattccagattgcatcaaagatcttcatcagctagagcaactctcccttgatggatgt 
agaagactcgcatcattgccagagctccctggttcgatcaaatccttaaatgcggaagat 
tgcgaatcactggagaccgtgttttgccctttgaatagcagtccaaatgcagtactcgat 
ttcaccaactgcttcaaattaggccaacaagcacgaagagaaattatccaccgttccttg 
tcttgtgaatgggttatcttaccaggaaaaaaagtacctgccaagtttgatcaccgagcc 
agaggaagttcattgaccattagaattccctatggtaacaatcctctttctgctgtatct 
atattgaagctttgcgttgtggtgtcacctaaccatcaaatcagcaaaaaaagtagacat 
tcagaatacttagtgtgccgctgcataagcaaaggcgacctagaccccgttgttgaggag 
ttttgcgtaagctatgtctccagatatcgatctgaacatctacttatatttcaccctcgc 
ttgccattcatcgtcccctccgaagtcagcagagagacagtgttcgaattcagcagcaaa 
tctcacgaattcgacgttattgaatgtggtgccaaagtgtttgaagacaagagcatccaa 
gggagctatgaatctggatcagaccaagtgtctgaagaccacattgattacctcactgat 
gggaggtacgaatcatgtgaagacgatattaatagtctgatcgacgagagccttgagttg 
aacc cagcgaat cc 1 1 1 acaggcgacac cgt at gaat cgagtgaggagcgcgaat ggagt 
gaggacgaatacgaattgagtgaacagtaccgccaaagcagacgacaatttcgagctcag 
tgccgccgaagaagatctagcatcttcgagaagtcacctttcctcggtaccgaaaccatg 
gttgaaacacgactccaagagagatctctgactgagcaagttgacgagttacggtcctta 
cacgacctcctcgctgctgagcttaagtctcgttccgactccctcgacgctcgtttcgac 
agactcgaggccctgatgttctccgccaacgcctcaccgctccacgccaccggaaaagcc 
ccactagaccccggcccctcccaccctccgacacccttcaacctcggttcttcccaacca 
cctgaccccccagatctcaacggattccgtcaacacaacgagtttggcagactcacatct 
cgcctttccaaaatctcgttcccaaagttcgatggcactgacctccgcgactggctttcc 
aagtgtgagcagttctttgatatcgacggcacctcacaggagctgaaggtacgcttagct 
gcgatgcatctcaccggaagagctactcaatggcacacaaactacatgagcactaggttt 
ggcatgttcccctcttggacagactatataatcgccatatctgctcgtttttgtgaactc 
tttgacgaccctcttgctgagctagtggccctcaagcaaggctccgatttggttgctgcc 
tatctcgacaagtttgagacgacaaggatgcgccttgtccttcctgaggctcatgctctt 
agcattttcctggctaacatgaaccctcatctctctctccacacgagacagtttgaaaca 
acctccattgcgggcgctgcaaagattgctatgctccatgaatcctctctctcccatacc 
ccaaaccgacaaagagcccccttcaatccttacccaaacacaaaaccctaccaaaaacca 
aacaacccttcccccctcctacctctaacccaagcccccaccaaccaaaaaccctccttt 
attccccgaaaccccactgacaaaccacctcgaaagttctcctaccaggaaatgcaagac 
agaagatcgaaaggcttatgtatgttttgtgatgaaccatttactcctggccatcaacta 
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aaacacaagcgctctcagatctatgtcatggagtgtgatgatgctgataccatccctgac 

gatagctcctctgatgccgagccagagtcagacgacaagcaagcgactgttgaagtcaca 

ccagtcatctccatcaatgccttaaacggctccacctcctacaactgtatgcgcttgatt 

ggtcatcacggcaaacacaagctgcatatacttgtagaccctggaagtacccacaacttt 

gtggacctcaacatcgccactcaactgggttgtgagctcgaaccaacgcgcccaatgtca 

gtgaaagctgcaactggtgacacccttctcacaaacttcaagtgctcggctttcacttgg 

accgttcaaggctcctcctttacgacggaaatccgcactgtccctcttgactgctgcgat 

tttgtgcttggagtgcaatggctttgcactttgggacctattctttgggacttcctcaac 

ctcagaatggagttcacactctctggcaccaaacatgtacttcgtggcgtcgttaagaca 

ggaggcaaggttattaagggctccagcttgaacaagctcatgctccaagaacctcaaata 

gccttaatccagctacaacagattgatgagaatgccgacaaccagcaaagcctcaatcct 

gaaatgttgtactctcacatctctgcttcgggaacaacaaatgctgacgacccagctctc 

caacaacttcttcaaacgtttgaagatatctttcaagaacctaagtcactgccaccgttt 

agagagggttttgatcaccagataccactccttgcgggctccaaccccgtcaacttacgc 

ccgtatcggtactcttctcttcaaaaagatacaatcgatacgatgatcaaggaaatgctg 

actcaagggatcattcagcacagtgctagcccctacgcctcacccatcgtcttggtcaag 

aaaaaagatggcacgtggcgtctttgtgtggattacagagggctcaacaagcagacgatc 

aaggacaagtatctgatccctctgctagaagacctcctcgatgaacttggtggcgccaag 

tacttctctaagttggatcttcgagcaggttttcaccaactccgaatgtcggaggatgac 

gtctacaagacagcattcaagacacatcaagggcactatgaataccttgtcatgcctttc 

ggcttgacaaacgccccttgcactttccagagtcttatgaatcatgtatttcaagacctc 

tctcggaaatcagtgcttgtctttttcgacgacatccttgtatacagcaagacttgggag 

gaacacctacaacaccttgcggaggtcttcctgatcctgcaacaacaacagctgtatctc 

aaactctctaaatgtactattggtgcaacgatcatagagtaccttggccactttatttct 

gctgacggcgtgagcaccgatccccgaaagattgctgtaatacgtgattggccaatccct 

acgacacagaagcacctacggagcttcttgggattggcaaactattaccgccgcttcatc 

aaaggttatagctccatcgctcgccccctcagcacgctactgaaaaaagatggctttact 

tggtcccttgaagctgatcaatccttctcagacctcaaagcagcgctcagttccacgcct 

gtcttagccctccccgactttgatcaacctttcattgttgagacagacgcgtcaaacacc 

ggtattggtgcggtccttatgcagggaaaacaccctatctgcttcataagtcgctccctt 

ggtccccgccaccaaaacctgtctgtgtatgaaaaggagttgatggcattagtgcacgca 

gtgcaaacctggcacccctaccttgcgcatcgccctttcatcatcaacactgatcagcga 

agcctcaagtacctcatggagcagaaaataactacacctttccagcacatgtggctctct 

aagcftatgggctacaacttcgagatccactacaagcaagggaaagacaatgttgttgct 

gacgctctctctcgagtctctggctctcagttactgaacatggtactatctcaggctcat 

actggtttctacgattcgttgaagctactctgggaaaccgatgctactctccagaagatt 

atctgcgacttgaaatctaacagctcttcacaccccctcttcacatacacaaacggagag 

ctgagacgtcgaggcaagctagtcgttggaaacgataaagatatcaagttgcacatcttc 

aaatggcttcacgactctgccattgggggacactctggacgtgacgcaaccttacatcgt 

atcagatctttgttctattggcctaagctaaacgtggaggtccagaactacattcgaaac 

tgtagtatctgccaacagaacaagtacgatatggctgccaaaccgggtcttctacaacct 

ctaccaattcctgatgggatctgggaatctataagcctagacttcattgaaggtctcccg 

ccgtctcagggcaaacactgtattatggtagtggtcgataggatgagcaagaacgcacac 

tttatacccttgtcccacccatacacggccctcactgttgctcaggcattcttggacaac 

atcttcaaactccatggcatgcccaaagatgtcatcagtgacagagatccgatctttatc 

agtgaggtttggaacgagctcttccgtgtacaaggggttactctcaagcgctccactgct 

taccaccctcaaaccgacggtcaaacagaggtaaccaacaagaccttggagacctatcta 

cgctgtatggctgctgagacaccaacatcctggagcaaatggctcagtcttgcagaatgg 

tggtataacaccacttttcactctgccattcaagctactccatacgaggtcatctacggt 

caacccccaccacttcacctcccttaccttccgggtgaaagctcctccgttgttgtagac 

cgcagcctccagaagcgtgaggaagtgattaacatgcttaaatttcacttgcttcgcgct 

cagaaccgtatgcggcagtatgccgatgcaaagcgttcccagcgagagttcaagatcggt 

gattacgtatacttgaagctccagccatatcgccagcatacagtgaaaaagaataaggct 

ccacacaagctctcgccacgcttttatggtcccttccgtgtgctagaccgtatcggcaag 

gttgcctacaaactctcgctgcctactgaagctgcaattcatgacacatttcactactgg 

cttgacctcggcaacaccaaagaacctgcagcaattctggagactaagactgttaagcgc 

caaaacgaagctgcaacaaaagttttagtccactggaaagatgaatcacctgaaatggca 

acttgggagttctacaaggacttcaccaacaagtatcctcttttcaatccttga 

(SEQ ID NO: 86) 

BASE COUNT 89612 a 47329 c 46973 g 88076 t 
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ORIGIN 

Rf Gene Region 

1 tggttttcat cccactcttg ttaccttcaa cacccttctc cacggattat gtgtggaaga 
61 tagggtttct gaagccttgg atttgtttca tcaaatgtgt aaaccaaatg tcgtaacctt 
121 caccacgctg atgaacggtc tttgccgcga gggtagagtt gtcgaggccg tagctctgct 
181 tgatcggatg gtagaagatg gtctccagcc taaccagatt acttacggaa caattgtaga 
241 tgggatgtgt aagatgggag acactgtgtc tgcattgaat cttctgagga agatggagga 
301 gttgagccac atcaaaccgg atgtggtaat ctatagtgcc atcattgatg gcctttggaa 
361 agacggacgt cataccgatg ctcaaaatct tttcattgaa atgcaagaca agggaatctt 
421 tccagatata gttacctaca gctgtatgat taatggattt tgtagctctg gtaaatggag 
481 tgaagcccag cgcttgttgc aagaaatgtt agtaaggaag atcagccctg atgttgtaac 
541 tttcagtgga ttgatcaatg cattggtcaa agagggcgat cttaattctg ctcaagacct 
601 tttacaggag atgatttcta gtggtgtgtg ccctaatgtc gttacttgta acactttgct 
661 ggacggtctc tgcgatcgcg ggaaactaaa agatgcattg gaaatgttta aggctatgca 
721 gaagagtatg atggacattg atgctactca tgccttcaat ggtgtggaac ctgatgttca 
781 aacttacaat atattgatca gcggattgat taatgaaggg aagtttttag aggccgagga 
841 attatacgag gagatgcccc acaggggtat agtcccagat actgttacct atagctcaat 
901 gatcaatgga ttatgcaagc agagtcgcct agatgaggct acacaaatgt ttgattcgat 
961 gggtagcaag agcttctctc caaacatagt gacatttaac acactcatta ctggctactg 
1021 taaggcagga atggttgatg acgggctgga gcttttctgc gagatgggtc gaagagggat 
1081 agttgctaat gcaattactt acatcacttt gattcgtggt tttcgtaaag tgggtaatat 
1141 taatgggtct ctagacattt tccaggagat gatttcaagt ggtgtgtatc ctgataccat 
1201 tactatccgc aatatgctga ctggtttatg gagtaaagag gaactaaaaa gggcactggc 
1261 aatgcttgag gaactgcaga tgagtatggt atgtaagttt ctgttcagtc tatgttattt 
1321 ttaatatgaa gaagaatgta tacatgcttt tgtgtgtagc ttcagattga tgatacatgt 
1381 tctggaatta accatcggtt tggttttgca ttgtaggatc atcatttggt ggggtgaatg 
1441 atcaaagatt tttctactct ttgcagcaga gcttcaatgc attttgtttt tgttgctgca 
1501 tttgtaccct actaatgttt gatcaaatcg tggaatagag tgatcatagt gtaaaaattg 
1561 tgtggtcaat gagctgtttt gctgctattc taatgacagc ctttatgcgt ctattgtagt 
1621 ttaataaatt tgaccatttc caattaagtt ccatacactt gttcacgcaa gattattggt 
1681 gcgaaataaa gaagcacacc ttccagaaga cttcaggtgt taaaagatgt ttagtgttta 
1741 atctttcgat cccacttttt gtttgtttgt atgcaggtat attagttggt aacatggaaa 
1801 atattctcct ttattacctt gaagtagtga tattgtctgg tttagttaaa ctcgtattat 
1861 cctaactggt agtatccatg agattcaatt ggtatgtgcc tgtctacaac ataggagttc 
1921 ttctctatga tcattctatt tcaaaatatt tatatgtttc aggtgtgagg caggtcaatt 
1981 agtggcaagg aagcttatcc tcaacttggt tcataatttt ctgcagaaag aaacattggt 
2041 tctcagctca aaatatgtta tcatcaacca ggtaatcgta tttaagacaa tccaacacac 
2101 cctgcttcac atgcaccttt caaaatttta agagatacaa ggtgataacg aattttattt 
2161 ataccgagag agtagactac atgtcgaacg tgtttaatga taagcgcaaa cgagaaaaaa 
2221 acaaccttca agaacgagta atgtgctttg gcatgcactt attaagagaa tagattaatt 
2281 gtatacagaa gcoaaaatga tttgttcttg aagctacgaa tcaatctgta tgttcagatt 
2341 ctcgacgatc tcttctagct tgagttttcc ctttcctttc cctccacccc ctcccaggtg 
2401 ccttcatcaa aggtttctcc acatcgcctg agatatcaag atgaactcct ggatcatgag 
2461 tctcgtactt tgcgcccatc attatggagt tgttcacagc ttgaaagttg ctgtttacaa 
2521 aagtggtcag gaaatcaagc tcttgcccac cgtttttgta gctgtctcca tggttatcat 
2581 caagctcggt cttcatggtg gctccaaggt tggatccaga gagcgtgatg gctctgatgc 
2641 catcctcctc ctccttctcc tcttcttcat attggttccg tcctgccttg tctgctccca 
2701 tgtgggttaa ggaggagatc atgtttttaa gatcgtggtg gaaatgttgg tgttggttgt 
2761 ccttcgctga ttcgcctcct cttgatttgc ttagcttcat tctaagtttt gcacaaccag 
2821 aagagagtat gggaaggcaa gtaatgcaaa ccatacatct ctaaatatat atatttataa 
2881 atatcactat atcagagtaa acgcaatgcc ttgtgttgta agtaaaggaa ggggaatgac 
2941 tttggaatgg gtttaagaat aacctttgga gattgagaaa tgtggttagg tatcacgata 
3001 agatgttgca gttaaataat tggatgatgt caaaaccgta ggagattcag agttctttca 
3061 ttgtaataat aaaaacaaca cacttgagac catgtcgata gttctcttcc aacataaaaa 
3121 ccaattggtt ataaaagatc gtcatcacat catagatgag ttaaatcttg acccgaaggt 
3181 tgaaatcaag tcttcttttt ccagctagaa gctttgttag gacttgcata catataacaa 
3241 ttagatatgg ggaaagggaa cacccttgtc tatgacatct ttcgctatta acatacctag 
3301 tcaatttccc catttatcta aaccgagaaa gatgtcagct caatacactt tttttatcca 
3361 aacacagagg agacaactcg cattgctctg ttttccttca cacatgcatc tatgtagcaa 
3421 acaacaaaac tgaaatgaac tcttcctgat ccattcccaa atcctttatg gttttgacgt 
3481 gtattgaaac ctctgatact ccacaagtac atcaaattct ttttgttcaa aaatactagg 
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3541 acctttattc ttagactttt gaaaattttg 
3601 gacccctcta ggaagagatt gtctttgagt 
3661 ttctctgcct cttcaggaac atctacacaa 
3721 cagttgaact tagaacggag aatattgagt 
3781 ttccttggtg tgttttctat tcactcttgt 
3841 caatcacttc cacggaagca caagcctcag 
3901 acaacctcag aatcagtctc tgcgtttttt 
3961 tttagtttgg tgtttgcaag acgtaatttt 
4021 gtttcattat cacccttttc cttctcactc 
4081 tgtttctcgc tagtagctct acatgctttt 
4141 cctctacatt cacccaccaa taggaattag 
4201 gaaaccaaat aataagaatt taatgaaata 
4261 tagcaattat aaaaaaaaat tttaatattg 
4321 accctaaatt ctaaacccta aaccttaaat 
4381 aagcctgaac ccttgggcaa atcctatacc 
4441 tcataaacta aatcctaatc cctaaaccct 
4501 tcctaaattc caaaatggtt tatggtttag 
4561 aagagtttag ggtttagggg ttcgtgttta 
4621 ttaagggttt aggttttaag attaacgatt 
4681 ctttccccaa gggtttaggg tttagtattt 
4741 ggtttagtat tttctgacgg atttatcaat 
4801 ttttttattt atctaaaaat ataattttat 
4861 ttttaagaga tatcaaatct taattaacta 
4921 accttagggg gtgaagctaa gttttgttct 
4981 atgcttgaac cgatcgagat aatttttttc 
5041 tcatatatat tttttaaaat tctaaacccc 
5101 aatttaaatt agttaatctt atgtgtatag 
5161 ttatgatcct tttatactat atatttgtga 
5221 attgatcata cttttatgac tgaaaaataa 
5281 ccacgattag aaaacataat ctctccgttt 
5341 ttgttttaaa aaagtgtatt ttttatattt 
5401 aaattgtgtg ttttaaaaac attaattata 
5461 aggaaatata gaattacaaa aaactatgta 
5521 ataagtgtga aaatcctaaa atatctatta 
5581 tttgatcatt tttttttgtt tttttttttt 
5641 aaaaggaaaa gtaaaacaac aaaacagtta 
5701 agccggtagc caataacacg acggttaaaa 
5761 gacgtaattt acacgatcga gtaataagtt 
5821 gtagtaaaac aagtccatta ttatatcatc 
5881 cgcagacatg catcttcttt ccattttctt 
5941 aaacgaaatt aagagaacaa acacaaaaag 
6001 agatcaacaa aaacaaagta gactaagaaa 
6061 agtatccaaa aaaatatcat aacaatgaca 
6121 tctctgggct cctactttaa aacgtaccgt 
6181 ttgacgcttc tcctcttcct tctcgcagcc 
6241 agtaaggtat aattaattaa gaaaaataat 
6301 taatcttttt ttgttgatca aagacgacga 
6361 gaaaaccggt acccgagcta gaatcaccga 
6421 gtttattgaa tcaaaaccgg tctggttctt 
6481 taattattat cgagatcttt tcattacgta 
6541 tcaacttttt gttgtttacc acagacaaaa 
6601 aaaccggttc acgtaccagt accagtacca 
6661 tcggtagatt gcacaagttt cttgaaccaa 
6721 caacccggct ataataataa ccaaaccgaa 
6781 tggatccacg aggatctaaa gccatggaga 
6841 agaggacaaa cgacagcgca tttcaggtta 
6901 aactacaaga agtctataca gactagagac 
6961 ctgagaaagt atccagggaa gttgcctgac 
7021 cctgttatta gattggacgg ttacagtaaa 
7081 gcgttattta gatacggcgg agatagatgg 
7141 ttctggggat ggtatgtact agaatatatc 



attctgatag aaacttatgg tttatcctta 
aatcacctct ttctctttag tttatcttat 
gatatgtctc cactatttca ccttgctttt 
tcaatagtat ggttaggaca tctgctacct 
tccataaatt tggatcttct taagtttttg 
taacctcaca aaaaaaaaat ctccacttct 
tatgatactc gtgcttcctg tatccttttc 
ctcattaatg tttgttaccc actgagaaag 
aagtcttctt tctcaacttc cttttccaca 
atcaagaaca aaactttacc ctaaagtgaa 
ttaattgaga tttgatatct tttaaaaaat 
aaattatgtt tttagataaa taaaaattag 
ataaatccgt cagaaaatac taaaccctaa 
tctaaatact aaaccctaaa ccctaggaaa 
ctaaccctta aatttaaacc aaaccttaaa 
aaattataac ccctaatact aaaccctaaa 
gattaaggat ttatgattta atatttgtca 
gaatttaggg tttaatatta ttgtttatga 
taggatatag gatttgccaa aaggttcagg 
agaattttgg gtttagggtt tagggtttag 
attaaaaaaa aaaaattata attgctacta 
ttaattaaat ttttattact tggtttcctt 
atttctattg gtgggtgaat gtagagattc 
tttatcaaat gaccccagtt accgtatgtt 
taaaattgtt atttttaatt ttaaatctaa 
tgaaaaaaaa aactccaccc ccttttccaa 
acatctttta ccttgttagt atactagtca 
taaacaatca ttttttggct ttactagaat 
ataaaaacaa at at tat at a atgggctaca 
cataaatatg tatgttttag gaaaaaaata 
tcagtgcagt tcttatcaac aaataataaa 
tttttaaaaa ttttatttgt ttaaaaatat 
ttaataacta agttttcata tatttttttg 
ttttgaaaca aaatgagtat atttttcctt 
ttggttttct caaacataca gctggaaagc 
aagcctaaaa cccacaagtg ggctctgttt 
tgtcgtatat gattatcata atacgaaact 
tagtagatga aattccaaaa ttcaaagagt 
atcatcaatg tcgtgtcctt gtcaaactca 
ttcctgctta aattttcttt aacttcaaat 
agagactgga agcaagagag agagagagag 
agattcctta gtctccaaag attccccctt 
tctctcttct ccaaagcccg agctctcact 
ccggtcaccg gaaaactcca tgtggccacc 
gctgttgccg tcacctcttc tctatggctt 
cttaatataa cgttatcaag aaaattaaca 
aacaatttga tacaccgaca ttagtcacaa 
agaaaaccgg agtactggta aatggcacaa 
actcagaaac atctctatgg cttaacaagg 
attaaaattc taatactatt ttcagtaatt 
tcatataatc agccgacaat aataacaaca 
gagaagaaat caacgaagaa aaccggaatc 
aaccggtccg gttcttgctc gagaacagct 
tcgaaccggt catgtcctga ttacttcaag 
gagacgggga taacgagaga aatggtggag 
gttatagtaa acggcaaagt gttcgtcgaa 
gagttcacac tgtgggggat tcttcagctg 
gtggatctca tgttcgactg tgatgatcgg 
tctaatcata cagctgaaaa tgcaccacct 
acggcggatg tcgtctttcc agactggtca 
tctacacttt ccctcctttt cttcttttgt 
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7201 tttctctgtc tgaagcgtgt agggttgtca 
7261 gccgcaaaaa aagattcatt tttttgttta 
7321 agaattaaat gattgagtac tctccactca 
7381 gctaatttat ccctattttt cgttccaaaa 
7441 gattagctaa aagtaacttt aatttttata 
7501 tcaatcttta cagcaaaaca ttaagtaagt 
7561 aagtaagtgt tttgttccaa aatcttattg 
7621 tttgtatcaa ttgtcatatt tgtttttctg 
7681 taagttgaaa atcaaataaa ctgaaataaa 
7741 attaacataa agccatggag caaagtgttg 
7801 aaatttatgg agagagaagc ttatgcatat 
7861 tcgagagaag atcttcttac ttgcaatgta 
7921 ttcattcagg tatatatata tatatatata 
7981 tataacccta ttttatctat taatttttcc 
8041 aggtataata acaatgaatt tgtgagcaat 
8101 tcagaaggac aaaaaggatt tgagaattca 
8161 gtactagtct atacattttg cctcatatag 
8221 tatatatatg ttgattttaa ttattgagat 
8281 gaagggtatg gatggtcagt gagtgagaaa 
8341 atggtgaaac catattacta tgatttcttc 
8401 tggcccatta atgataagga taaatgtaga 
8461 aatcacactc aaaaggtatc attaccaatc 
8521 ttacatgatc aattttccct ccaccaaact 
8581 aaaaaagtag atcccctagt ttacaaaaag 
8641 attagtttca gttttttttt ttttgcgaaa 
8701 ttcattttat ctttcgaata taatctacaa 
8761 gtgattagat aaaatcttgc tatttttctt 
8821 gttgcgtaat actttgaaaa gttggatcgc 
8881 gttgcaatgc aggctcaaga gattggaagg 
8941 tcaatggaaa acgtgtatga ttacatgttc 
9001 aagttcaagc ctcaagttcc ccaaaacagt 
9061 ccttctggag atgctaatga tactaataag 
9121 cctcacaatt caggcccatg ttcgctacct 
9181 ttttatagga agaaactgaa tctcatccgg 
9241 gaaaacgttc aataaaatga ttgtattaaa 
9301 tatatcggat cacgttagaa taatgataat 
9361 cacagatggt ccacataata acatatgcag 
9421 acctaaatcg agcccattga aacctgctga 
9481 atatgtagta cattatatta tatctaccat 
9541 gcttcttatt ttattttagc acggctcatt 
9601 atacatgttt tgctatataa tagttcatgt 
9661 tgtgaggcta ctgcgatatg attgtagtgg 
9721 taaaagatat gggacagttg tagttaaaac 
9781 gacgaataca aattaagttc atctattcta 
9841 agttgtgtcc aactgatttt ttttataact 
9901 tgttctcttt gtataataag cagttacgtt 
9961 ggtttatgat tttaaaattg ggccatgggc 
10021 gttacatttg gtctgattgc aatggccata 
10081 aggtttatgg ttaatttaat tttaatagaa 
10141 taattttata taagattact attatatcta 
10201 tatctattag cataactgta tgtatatata 
10261 tgcattgcaa catttttaat aaactaaaaa 
10321 tcaatttgta ttcataaatt tatgattaga 
10381 tcagacatat ttacagaaaa aaatacaaaa 
10441 caagatctaa tataggtcta acattttacc 
10501 gaaaatacag gttcagatcc gagtctatgc 
10561 acccacaagt ttcggtttgg atctacgtcc 
10621 ctcaaaaatt attatatatt aggtaaattt 
10681 atattttttt aagtttcagg ttaaggtttt 
10741 tagatttcta aaaaatataa tttgggtgtt 
10801 tcggatctga ttttgggtaa gtttcagata 



acatgcaatt gtccccactt ttgatcacat 
tatatagtaa tttaaagggt tcgaaaaagt 
atgtcactac tagttaaata tgaggttgta 
aaagctaatt tatttatgct ttgaataagt 
tcgcgaacgg aactttaatt tttatataat 
gttttttttt ttgataaaag atttaacatt 
gtaaattcat attgaacttt gaaatattgt 
atgatttatt tgcatagttg aaaattttaa 
ttggattaaa ttggtggata aaggcaagag 
acagaaatgg aaaaaggaaa gaagaagaag 
tggaaaggga acccttttgt tgcatctcct 
tcctcacaac atgattggaa tgctagaatt 
tatatgcttc tataagctta tgttcgtacg 
atgaggtata ataacaatgt tgataaaaat 
attttctttg tgtttacaca ggattggata 
aatgtagcag atcaatgcac ttacaagtta 
taccaatgtc taatttgagt tgaggttttt 
aataagttgt gcaggtacaa gatatatata 
tacatattgg catgcgactc agttacattg 
tcaagaactc ttcaacctct ccaacactat 
tccatcaaat ttgctgttga ctggcttaat 
ttttttttat ctcagtctag ataatctttc 
acaccttgaa taggatttta gcttaagtaa 
taatttatgt atatatatag tactcgagta 
taattagttt taattttgta aggaaagttt 
aatagttgtc tatacctata agttaatact 
tattgttgaa ataaagtacc aaaaacacat 
ataattatgc atactttttg tttataattg 
ggagcaagtg agttcatgca acgagatcta 
catttgttga atgaatactc aaagcttctt 
gttgaaatct gcacagaagc aatggtgtgc 
agatttttga tgggctcttt agtcgatgag 
cctccttttg atcccaacgg tctcgagaag 
caagttgaga aatgggagca cgcttactgg 
ctgtttgata tgattttatt ttagctttta 
tttgagaaat atataattcc ttcatcagca 
cttcttagca tgagtagatc aatttcctca 
actaatagat cgtgttaata gcttttattt 
aattaaggtg aatgactgtg tgagcttgaa 
tacatgtgta ttttatttgt aaaagactgg 
atttgaatgt tcttgaagat taaaaaggca 
gaacttaatg accaatttca tgtgaatgct 
taaaaagttt gttttcctgc cttgacgaat 
ttaattcttc aacatgtcct attgatattt 
gtatctaata atttaattaa atcaataaca 
ttcttccaac actaagaaag agtttttgtc 
ttcgatttcg ttttaaaaat caactgattt 
taaatttata ttgctttctt aaccacattt 
gaagactaat aatattgatt attaaaacca 
taataattta tagatatttg tttactatta 
tatatatata tatatatatt atttatagtt 
attagtatat tttaatttta aaataaatct 
aagttaataa taaaatagta ataattaatt 
caacactata tatgcgcttc aagcacagat 
ggacccgaag aaccaaacta gaacggat cc 
taaaatattt attggatctt ttttatgggg 
tacaagagac ccaatcgggt attcgaaata 
ggatgatttt gtgtattttg gatatttcag 
taggtataat ttcaggtttc gggtaaattt 
c 99gtaaaat ttgagatacc ttttgggttt 
tattttctcc aggtattttt aggatttgca 
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10861 ggtacttttc gagttttcaa attcagtttg 
10921 atatgattac aaaaaataga tttataatta 
10981 aaattattat tatttataca gtaaatcaaa 
11041 tgactacaaa aaaatatgat taagaaaaaa 
11101 acatgtccta ttgatattta gttgtgtcca 
11161 tttaattaaa tcaataacat gttctctttg 
11221 ctaagaaaga gtttttgtcg gtttatgatt 
11281 tttaaaaatc aactgatttg ttacatttgg 
11341 tgctttctta accacattta ggtttatggt 
11401 atattgatta ttaaaaccat aattttatat 
11461 agatatttgt ttactattat atctattagc 
11521 atatatatta tttatagttt gcattgcaac 
11581 ttaattttaa aataaatctt caatttgtat 
11641 aaaatagtaa taattaattt cagacatatt 
11701 atgcgcttca agcacagatc aagatctaat 
11761 accaaactag aacggatccg aaaatacagg 
11821 ttggatcttt tttatgggga cccacaagtt 
11881 caatcgggta ttcgaaatac tcaaaaatta 
11941 tgtattttgg atatttcaga tattttttta 
12001 tcaggtttcg ggtaaatttt agatttctaa 
12061 tgagatacct ttcgggtttt cggatctgat 
12121 ggtattttta ggatttgcag gtacttttcg 
12181 gcctaatctg aatagataaa tatgattaca 
12241 gtgtatgccg gccaacttta aattattatt 
12301 taagatttat atacatatat gactacaaaa 
12361 tacaattaaa tatctagaaa aagttaaaca 
12421 tcagaatcta gattctgaag accgaatgtt 
12481 aaataaaaaa tcatataaac tttttaaaat 
12541 gggtctatag caaaaaacaa tagtttaaaa 
12601 taaaatcact gtgaaataat tcacaagttt 
12661 acggataaga attatttgga gtttttaaaa 
12721 ttcatacaat aattttagat aaatttcata 
12781 tacaaaatat tcatgacata aacttaattg 
12841 gggcgggtca gaatctagtt caatgttaat 
12901 ttatttttgt ttttaaaata attatgcgtt 
12961 ttttagtttt ctacacacat aacttgccat 
13021 aaaaaatata tatatattta cgatggcgat 
13081 tttgctcaaa aaaaatgtat gtgaatacat 
13141 ttttaaaatg atatatgttt taaagtaaaa 
13201 aatattttat tagatagtga attgtatcaa 
13261 ttttgattgg ataaaaaatt tgtggaaagt 
13321 aaaattttta atgtggatga aaattcaaaa 
13381 tagacgaaca tgacgtatat ttattgtgtc 
13441 cattattaat gcatatcata atgttagttt 
13501 tcacgtgagt attattggcg gtgaatattt 
13561 atagtatgta aatttgtaag acttgataaa 
13621 gaataaccta gtttgttggc aaataaatat 
13681 tgagtgattc aaatttacta ctccttccgt 
13741 tatttttaaa agatctacat tttacatttt 
13801 ttcgaaaatt taattttact tattgaattt 
13861 aagacaagat tattcaaatt taatgtgttt 
13921 taactttaga aaatagaaag aatatatcag 
13981 ttcaattgaa gcttaaccac tatgttcaat 
14041 gactggttca aacgcatcga ttgcggttac 
14101 tagtttcaag cgttatttag agttttgtat 
14161 ttgcgaaata tttgtaactg gttgattatg 
14221 caatataaac atattataat atataaaaat 
14281 tattattaat ataatatgat taataataat 
14341 ttgaaagtta taatttaatt taaaaattga 
14401 ttgtttcgtt ttatatatgt atatatcttt 
14461 atctattgaa tacctagttt aaagttttta 



gatatgtcta ggcctaatct gaatagataa 
taaagaaacg agtgtatgcc ggccaacttt 
tagattttga ttaagattta tatacatata 
aacacaagta tctattctat taattcttca 
actgattttt tttataactg tatctaataa 
tataataagc agttacgttt tcttccaaca 
ttaaaattgg gccatgggct tcgatttcgt 
tctgattgca atggccatat aaatttatat 
taatttaatt ttaatagaag aagactaata 
aagattacta ttatatctat aataatttat 
ataactgtat gtatatatat atatatatat 
atttttaata aactaaaaaa ttagtatatt 
tcataaattt atgattagaa agttaataat 
tacagaaaaa aatacaaaac aacactatat 
ataggtctaa cattttaccg gacccgaaga 
ttcagatccg agtctatgct aaaatattta 
tcggtttgga tctacgtcct acaagagacc 
ttatatatta ggtaaatttg gatgattttg 
agtttcaggt taaggttttt aggtataatt 
aaaatataat ttgggtgttc gggtaaaatt 
tttgggtaag tttcagatat attttctcca 
agttttcaaa ttcagtttgg atatgtctag 
aaaaatagat ttataattat aaagaaacga 
atttatacag taaatcaaat agattttgat 
aaatatgatt aagaaaaaaa acacaagtat 
aaaatataca tgtcattttg aaagggcggg 
ctcaaatatt aaaaaaaata gtttcattta 
acacaaaaac agattttatc aatacataac 
atattaaact ttttatttaa attaaaatac 
aacagcaggt taataaacaa aaatcataat 
ttttggtcat atttaaaata acaaaaataa 
aaattcttaa caaatataat ttatcaagga 
tttcaacaaa aaatatatcc gccctctgaa 
cttataagca tgaataatta tttgataaat 
ttatcatcta ctactttcta gtcacttgtc 
ttcaaaaata tttacgatgt taagttcaaa 
attttacaag gcatttggtt actcgttttg 
taaaacatca aaaataaata ccctctccgt 
gttgttttga aaaatatttt ttatattttc 
gagtttaaaa aactaattgt atttattaaa 
aattaatcac aaaataatat atttaaattc 
acctacattt taaaaatcga ttgagtgtat 
aaagagcgta gtaactgtta ttggttgtct 
gttaattttc attaaatggc cttttccttt 
ttaagtctga tttttctcat acaaaaggac 
atgtttttta aaagaaaaat gttaagaaca 
ttatccttaa acatattcca tactttgttt 
ttcttaaagt tagatatttt agaagaaaaa 
caatgcatgt tttattatct aattgcaaat 
ttattggctt aaaattatgg aagaaaaata 
tattaaaatt tgtgaaaaat ctagaatatg 
aaaaaaaact aaacgtagat ctgacaattt 
atctttgacg aaaatgtctc gtaaaggcct 
gegtgcggga gtttgtggat gtggtggctg 
gattggcata atatttttga atttttacgt 
agatattaca acagtttaat aataaattat 
ataaaaacat gctattgtga taaaattaat 
gttgtattta ttttttaata ttttaattag 
agatttatat ttaaaaaaat aaagaagatt 
atatatgtat gcatattaat tttaaaaagt 
caaaacaaat tatgataccg tttgtgaatt 
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14521 tgaattaata tatataattt atatattttt 
14581 tttaaaatgt ataaaaaata tttttttatc 
14641 agcttttgat tttaaaaggt ttgaagcggt 
14701 gttttgaaac attgtcaacc gctatcaacc 
14761 gaaaccagtg aagccctaaa tagtgtaaaa 
14821 tagtatgtaa gtttcctata taaatttggt 
14881 cgatttccct tagtgtatac tcaatgtttt 
14941 acctaattca tgtccaaaca gataagcata 
15001 attgtttagt taggaaatat ttaacgtata 
15061 aggcactaaa aaccgatata gttaatttaa 
15121 aagacaagag atttaacgca ttgacgtccg 
15181 tcccactaca agaaaaaatg cttttattag 
15241 ggcggtttat gaagcgttgt atcatcggcc 
15301 tttttttgcc ctgctatcgt attgttatat 
15361 attctttaat agcgtttttt tttgttatta 
15421 tttaatttaa ttattaaaaa taaaaatata 
15481 aaatttcttt attatttaaa cctatttaca 
15541 aaaaaaataa atctaattag ataaaaaaat 
15601 aaagcttaaa cctaattact aattaactaa 
15661 gtcttcttca tcttccaatc ctcagccgcc 
15721 cccttcgtcg tcgcatcccg ccatgattct 
15781 ttgcagagag gaggcttact cgtcgtcgca 
15841 tccacgaggc ttgctcgccg tcttcccggt 
15901 gcagcagcag agttcagagt cgaaggcgcc 
15961 gacaaaggcg cattgctaaa tcaaactcaa 
16021 aagataaaag atatagcaga gaagactcca 
16081 ctgtttctcg tagagaagaa aagaaaaaaa 
16141 attgttgtcg ttagatcaaa actaatctta 
16201 aattgaccaa tagacaaact tctttatttt 
16261 gtgtttttta ataacaatat ttttattttt 
16321 ataactatag attgatttaa aaggttggag 
16381 tttaatgttt ataatataaa ttttgtggta 
16441 atttaatatt aaataatgaa aataatttgg 
16501 tgattgaaga tttaaggtct gtgtgaatat 
16561 tatataactt tgacaaagtg actacatttc 
16621 gatgcattat ataactttga aaagtgttga 
16681 aatagaagta ggaatatagg gtttgagggt 
16741 aaaacttgct tatagtttag agatttaaat 
16801 ttgagtttta ggggttgtgt tttaaaaaat 
16861 agatttagat ttttagaagc aaatgaaatg 
16921 agagaaaaac aattatattt ttgccttagt 
16981 atgtataatg catttataaa aagagtttgt 
17041 gtttccaaaa atacaaacgc tatctaaaac 
17101 aaaaaacgct atcctggtct gctattaaaa 
17161 tttcattttc cttttgccct tttggtggat 
17221 ttatggagtt tggtggaggt tcatgatgta 
17281 agtggatcca ccgggatggg tcagcattta 
17341 catttttcgt ttgacagtag catacatcaa 
17401 cattcaaaaa taacacatct atatatatct 
17461 attatatatt tttaaaaact aatattgatt 
17521 atttcattaa cttaattttt attaatagaa 
17581 ttctaattaa atgtataaaa atatcaattt 
17641 atgaaaacca caaattggaa ccatgccaat 
17701 tgcatcgaga tgaaggcgta tagtttcgat 
17761 tttattagtg atggacctaa ctcacaccca 
17821 atcacatata tattgcccaa ggatcacatc 
17881 ctctcgagat gtgggtggga aacggtccaa 
17941 acataccaca gcaagatggg ccatttttat 
18001 gggaaatcat ctgctgggct taaccatagg 
18061 tcacacccaa aagctagctt aatagtggag 
18121 gatcacatca ttaaacgatg tgggacattc 



atttatatta tttccatttt aattttaaat 
tatgcgcaac cgtaaacgct aactgcaacc 
ttgaaacgat ttatagcggt ttatatgatt 
gcaaaaattg cgtttgcagg tggtagcaca 
tatctaaaaa ttatacctta cttttatacc 
gtcatcacaa agtttcatta tggtccttat 
gcttgtaagc atcgggtgtg acgacttgga 
attattattt gtcattcgta tatgtatatt 
tcgttttagc taaaatcgtg atgaaagaaa 
gattaaatac ataaataata gaaatatcaa 
gtgtttttct ccggagtgga aagtcttttc 
cggacgaaaa acgctatcta acgatacgat 
gtcataatag atcagacata ttagatagcg 
attatagcgt tttctgattt gcaattaaat 
tttaatattt atttgtttat tttaaaatta 
attaaatttt agaattgatt tatatattta 
attaattggt ttacaaaaat tgaaaaaata 
actaaaccaa ataaaccaaa aacctaaacc 
cgtatgccgc cgcacaccac cacctacatc 
tcaccaccac cttcgtcgtc ttctttaacg 
tcttcttcgt tgtgggattt gatagtcaag 
tccagccacc ttcgtcgtca ttgtctcctc 
cgtcttctcc aagtatcttg agtgacaatc 
agaggcgaag tcgaaggtgc cagaggcaga 
catccgaact ttaaaaaatt agttgagaag 
aagggaagaa gaaagacaga gagagtagat 
aaatgaaaca aagaagagaa gtacaaatga 
gggtttgtat tgcaatccac gtgattcatg 
tattttttac ttttttttcg acaacctttt 
tgtttaaggt tcgtgtacgt ataataagta 
tttgatttat gatttagagt tttattacag 
tggggtttag ggtttatagt ttatggtaag 
tgtttaaatt ttcttctaat gcatatagtt 
ctaccgaatt tatatttata aaatgaatga 
atttttagtg aaaatttata aatgcatttt 
taaagtttgg atctaatttt tgaagcttat 
taaatttttg ttttaagagt ttagggtttg 
ttaagatctg atattcaaat aagaacaatt 
tgggttttag ggtttgaaat ttagggtata 
gagtaggagg gaaagtgaaa aaagaatgag 
tatacatagc gttttgaaat tgatagtgct 
agcaataata acggaccaac caaacatagc 
taacggatat gtcaaccata gcagaaagcg 
cgcatttttc ttgtagtgtc ctcttttcta 
cttggccttc ccgacgatat gcatggtgat 
gatatagccg tggatcagca agttgtgtga 
ggttgttgtg gctgtaccgt caaacgtttt 
gacagtaccg gtactaagtt ttatcgggcc 
ttttagtaca caatttagtt tattgtaata 
aaattgactt ttatttgttt tagtatatat 
aatttattag tttgatatgt tttgatattt 
catataaaaa atttgcaact taaaaaaaaa 
atttcatggg tttttagtca agcctggcac 
ttcgtatctc ctctatcgac cgagacgtgg 
aaagctagct caagagtgga ggattgtcca 
attaaacgat gtgggacatt ctaatagccc 
cacggaaaca gcccaagccc aacatctcgg 
aggctacata agtagacaac tattttatat 
ctctgatacc atattagtga tgggcctaac 
gattgcccaa tcacatgtat attgcccaag 
taatatggtt caagattgaa attagtgggg 
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18181 cgtgttaggg cgtgaagagc aggctcggca 
18241 tgtgaatgac gaaggctttg tttccttgtt 
18301 aatggtgtta acttgattgg cttgtgagtt 
18361 gagtcgtcta taactcaacc ggaaattgga 
18421 gttccgtcaa caacgtgatg gtggctttag 
18481 gacgtgtgaa atttcaattt ctcttggtca 
18541 cattggagat atataatttg cagccgacaa 
18601 ttctgttatg aggtttgtgc tagagcgttt 
18661 ggatgtctgg ccaatcaagt ggaatcatgt 
18721 ttgctgacat ttataaggag aaaagtctta 
18781 aatatatatg cacctcttgg tgttgtaatt 
18841 ataccatctt cttcgggcta caaatggagc 
18901 atatatcttt ataaccttat tttagaggta 
18961 tgaatcatgg aggtctctca tctcactgtg 
19021 atggatctga gtgtgtgtat acatccacta 
19081 gtgtatgttt aaacattgat atatccaaca 
19141 acatatggaa aatcaaatgg tgatagtata 
19201 cacatcagtt ttgtgttcaa tttcttttaa 
19261 gactttgaat tttgctataa gttgaatttt 
19321 tagacgatcg atctactcca aaacattagt 
19381 tatgataacc aatcttttat ataacactaa 
19441 cgacatagta tataaaaaaa aaatacacgt 
19501 ctcaatttta tgaagttgcg atcagttaag 
19561 atatatatat atatatatat atattgttat 
19621 ttaatacttt ttggaaacat aaataacatg 
19681 gaagattatg tattttcttt tctacacatt 
19741 ttctttttat atctcttttt atatttattt 
19801 agcagattaa gtacttcttt ttaagttttt 
19861 aattaaactg taattaatta attaatttaa 
19921 ttggtgataa aaacaaaaac taattttttt 
19981 ataatattat cttatttaat tttgtcaaat 
20041 aaaacaaaat atcataaaac taataataat 
20101 aattagtata atgttttatt ttaaaataaa 
20161 taatacactt atatcaaatt gcatcaaatt 
20221 aaaaaaatta ttaacaaaaa tctatattat 
20281 aaacataaaa atatatatga ttttttataa 
20341 aaataaagaa acgtagaatg tactagaaat 
20401 ttttatagta tgacatattg taccaaacgt 
20461 attaataaaa aataataatt atttaattag 
20521 aattatttat tttccaagct ttgttgtgtt 
20581 tatttaagtt tgattttttt tttttgaaca 
20641 acagagttag agagacaaga tagtgcagcc 
20701 ttacgagaaa caaactcaaa cgaaacagaa 
20761 aaaagaactc cttgtgcttc aacaatcttc 
20821 ttggaatctg atcgaatgac cagcttacgc 
20881 gctttaaccg caagagcttc tgcaattaga 
20941 gagcgatgga aggtcttgtt gctcatatct 
21001 gtgccagcat tccacgctgc atctacaaaa 
21061 tgacgagcct agagaaagtt atctgatggt 
21121 tgggcggttt gccaaatcct tgcttccttc 
21181 ccctctcggt tctcaaaaat caacatgttt 
21241 ggatagagat cagtttcacc aaggcccagt 
21301 tgcaaaagag cttccgggga ggtgatgatc 
21361 agatcccaaa cacgacgggc aaatgggcat 
21421 aatccacatc tcttacaacg cccttccact 
21481 actggcaaat ctttactttt gatcttccat 
21541 atttgccata tacagaggtt ccagttgaag 
21601 cagtatgggt agcccgtctt ggttgagtac 
21661 tcatccttca tgtggaactc gctaggaact 
21721 aagtgtagtc taatcgcagg cagatcccat 
21781 actttcagct ctttgttctg ctctgtaggt 



cgtgggtggc tccctatctt caacctgcct 
gacaatacat aaccagagtt taaggaggct 
acgttctgtt gagcttactt tagtcgtgga 
aaaacaacac ttcaacagtg tgagatcgaa 
gaatagttgc actcttgttc tcagatgcag 
atgagggagc ttaaaccttg gtaaggctta 
aattgtttta tccaatagat ttgtctcttc 
gttgaagcct caatctcaac ccaactttca 
tgcttttggc tacatattcc aatataatgg 
gtccctggct aaccgagacc attgaatgag 
ggaggtcatt tttagcaatc tgatcattcg 
ttttttgttt tgtaatagtg gaaggtttag 
aaattaggta gtttaagagg tattttgctt 
gttaaaaata cagcttgtac tctaggttgt 
aatccttatt ctgaatttgt ccaacattat 
cgagaaattt tagaaataaa caaatggaac 
gtgataattt tttggtgttt gatatgtatt 
aaactaaatt atcgttgttt ggtaagtttg 
ggtataagtg attaacaagg tggatagtaa 
cggaaaacaa ttcaatcttg gattagacag 
atatgttttt ctgagattaa ctagattaac 
atgagtaatt tgtcccaatc ttgtttgttg 
gaaggccatt tatataattt agatatatat 
aaaatttata ttattaaaaa tgaagtaact 
ataaataaga attgtataga aacatgtata 
tagccattgc attataataa attaagtact 
ttatttacat attctttttg tatctttttt 
ttttattttt gtatttcaat ttaatttatt 
attaattaat ttcaatgctt attgtttttt 
agtatatatt atataaaaaa agtgttaaat 
ataaaaattc cgagaattac aattttcaag 
tcatagaaaa ctaatgcaaa aaaataaaaa 
ttaacaaaaa ataacaaatt aatatatgaa 
ataattttaa atataatatt atgcaaaaat 
aaaataatat attttactct agatgtaaca 
aaatagcgat ttattaaatt agttgtaatt 
aaacgtgggg gtttgttgga tccatctaaa 
tggacatata tgttctgtaa ctttgttgac 
ttgttgttgc ttcttttaga tcatttgatt 
catttgcata atatttttat ctactattct 
aaccttcgtt acttcaaata ctcaaactaa 
ttagccacat catctgcctc ctcattgtac 
ttaaaaagag tacaaaggtg atcaatatca 
tctttggtgt taatggccat gataagggac 
aaacccagag accgtgccgc ttttaagcca 
gcagagccga cgatgctgcg gtttgatgaa 
ttgaagatgc acccaaatcc tccaccactg 
cat t gage ct ccacatccaa gttagaaacc 
gtagataggt tctgtttcgc tttctcttgt 
agtgctaagg ataccagttc ttgetctgat 
ctccctatcc acaagtacca gaagatccaa 
ggtggcagat taactatgcg tegegaggtt 
gctgggtctg gtttgaacat gacaggcacc 
tgtagaaaaa catgacgctc tgtttctatc 
tgaatccctc tgtgtaaaag atttgeacca 
aggaaatgtt teagcttegg ggaagttcta 
tcagtcagct cttcaccacg atttgtcttc 
tctccagaac gtgtatggag ccagcagagt 
agtttcaaaa tgtgttcctc atactgaggt 
tcctttgtaa tgccattcaa aaggtccgat 
ggacccattg gacacattgg agtctttggg 
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21841 gagagccaag gatcagacca ggcgttgatg 
21901 ccttggcgta acacctctct gcccgctata 
21961 ggagctggag ttgagaggaa tggggtggag 
22021 agcaaggact ctggaaactt cagtattcgc 
22081 aacttctcaa tctcccgaca tcccaaaccg 
22141 caagcaaccc aagacatttt acgatgatct 
22201 acagtttgta tctgtttgca gagtgactta 
22261 ggcatagcag agagaataga cttcaagagg 
22321 gaggtccagc tatgcgactt ttgcctgatc 
22381 ttctttctgt tgaagtgctc tgggaggcca 
22441 atgttaagct ccgctttgac cctgcgtctt 
22501 atagcagact tcagacaatt gatgctttgg 
22561 ttctttagtt ctttgcaaga tatcgggtca 
22621 aagaggagat gatttagagg tggtgaattg 
22681 tgttcaaacg ccttagagca gagtcccgat 
22741 ggagacagcg ggtcgccttg tcggatccca 
22801 ccattgatga ggtatgagta cgaaaccgag 
22861 gtagggtgga agccaaatcg gaccatgaca 
22921 taggccttac tcatgtccgt tttgacggcc 
22981 cagaggtaat gaaggatctc gtgagttatc 
23041 aggctgattg gtgtggagag atcaatgaag 
23101 ccttcgcgat gaccttatag tgtgtagagc 
23161 tcttgggcgt tttgattttt ggtattaacc 
23221 gttgacccga gataaagaaa tctctgacct 
23281 aaaatccctg atagaaactg gccgaaaaac 
23341 tagcgaacac tgctcgcttg atctccaaat 
23401 tttcagggga gattctgggt gagatagcat 
23461 ccgatgtgaa cagggtatca tagtactctt 
23521 acacagcagt accatcttgt ttctcaagaa 
23581 tagtaatggc atgaaagtag ctggaattat 
23641 tacgttgacg ccaatacttc tcctcctcct 
23701 tgccaatgat atctgtatca ggagtactgc 
23761 ggctctggat agactccttg ctattcagat 
23821 ggcgtactcg acaaagcctc gaccatttaa 
23881 tcttttagaa catgttcagc tgagttttgt 
23941 ttttactttc taaaagagtg ttaattgttc 
24001 aacatgttta tttaatcaaa tcgcaagttc 
24061 ttctgaagca acttaaaaca ataatatttg 
24121 aaactagatt ttttggtgtt agtgttgaaa 
24181 tgcagatata gcaaatttca tcagtaagag 
24241 atcattttta gagaaaattt tctacgaaga 
24301 ctctatagat ttagtgttta gatttgagga 
24361 gtttagagat ttttttaaaa atataataaa 
24421 tgaatttcca aaagataaaa aatttcgaat 
24481 tttgtcaact attttgtttg tttttgttta 
24541 gtaaggatat aagagttttt ggctttttaa 
24601 tagatttttt aataaagaaa cctattttat 
24661 aggagagtat acaacaaagc acaactctta 
24721 atccttggac gacccctcac tgaatcgact 
24781 gaatctacat ggttatatat ccaaaaataa 
24841 caatcaatat aaacctccaa gcataagctc 
24901 tcttttaccc tcactataat agtggagtag 
24961 gtattgaatg ttgaccaaaa aaaaaaaagt 
25021 aaaaaagttt tatgtaaaag aagagtatat 
25081 cggtgaacag attttattta ttccttttta 
25141 gttgaccaaa aaaaaaaatc cacttatcat 
25201 ttgcatgtat atgttcaacg actgattaaa 
25261 cggaagtact gtatgtccca ccataccaat 
25321 accacagaga tgttcttaat gattcaagtt 
25381 ttagcaacaa tactatactc tatatttaag 
25441 cactattctt tttttttttg tgggcaaaat 



tccgatccgt taccaataac ccatcctagg 
actccacgcc aaccatgaga agccgattta 
tgacagtact tccctgccag tgtttgagcc 
caggccaact ttgctaacag agcatcatta 
ccagctctct taggtagagt aagagtactc 
ggtttatcgt cccaccaaaa gcgagtgagc 
ggcaacttga agcacgtcat cgcataggtg 
accaacttcc ctgcgccatt gaggtaccta 
cgatcagtaa tgctagcaaa tatgtctctc 
aggtatttac ctataccccc ctcacccaca 
gtttcaatgg gtgttttggc agagaaggtt 
ccagaagcaa. cctcatattt ctcaaagatg 
ctcttgcaaa agaacatcgt atcatcggcg 
cgcgctacct tgatgccaca cagctttcct 
aagacctcag agcacaaaat aaataaatag 
cgcgacggta caacacaacc gtgagcgccc 
ctaacacatt ccataatcca atgcacccat 
tctctcaaga aactccactc cagtcgatca 
atggtgcaat gcttctttgc tgatgaggta 
agaacattga aatagaacga ccagccacaa 
ggaggattgg tttcagcctt ctgcacatga 
agagtgctat gggcctgtaa tccgttactg 
ggacgtgagt ctcgttaaac cgacgatgaa 
cccgacagac ttcttctcct atgatgtccc 
cgtcgggtcc cggagctttg tctctatgaa 
catctggtat acttgtgagt gttgcattca 
cctctatgat ctgagatttc ccttgagacg 
caatagaggc tgctatctgt tcttctgcaa 
cagagaattt attggcagcc ttgcggcttc 
ggtctccgtg ttgcagccag aggattctac 
cgtaggcttt gtcaagctcg tttgtcaaca 
ttgatagcgc ttcctccaac gccttttgga 
tttgttctct agaccatttg ataatagcaa 
gtttgattta ttttcatgaa accactgatt 
agcttttcta agattcacac ctctttcttt 
atatgtatct actttttcac gactcaatgt 
aatattacac aaaaagccaa tgtaataaat 
ttttggatat gataatctaa ctagcttgga 
acttcatatg cgatttgatg acgatgtttt 
atcattctaa tattggcaag tcaatcatgc 
ccctaaattt gttattatca taaatatagt 
gtgaagtttc ttttttagga aaggagtagg 
aaaattttaa aatagtttca aatgaatttt 
tcgaaaacat ataattcgaa aactataatt 
aataattatt tatatctata tatctattaa 
taaaaccttt ttaattcatt tttctccttg 
aattatttga gacaactgcc ccttacaaaa 
cgagtcaaaa ctgaaagcta cttggtaaat 
ttgtctccat ttgatgcatt ttcctattat 
tagttgaaac aatatgtttg tgacagatct 
gcacgattgg tggcgtaaat tctgacggac 
tattatatat aggaacacaa ctcattgcaa 
cttctaataa tttatgcaaa ctttgattaa 
ttaaattact cttttacagt ttttttctca 
taaaaattac gatataaatc tacttgtcat 
tagatcgtaa aaacgctcta gcgttgcaac 
atacaattca tagatgacat atacggacta 
agctagccta acaggagcct agatcaccgt 
aaatcattta acgcgctatt acttcaatga 
aatcacccaa cgaaatttga taatttcaaa 
ttccaacact'attctagata gactttttgc 
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25501 gattggaatc aaaagctagc gtgtatatat 
25561 aatcatttct aaaaacaatt tgtactcgta 
25621 tagtgatttg ccattatgta taaaataaaa 
25681 agaatacaat tgtagatatc tgaatggtag 
25741 ggatatatcc tgatcagcac tagaattatt 
25801 tatatgaaat tgaacttata aaaaaatgtt 
25861 ttattatgct attattttaa tttatatttt 
25921 aaagacctgt ctttgaatcc aagtctctct 
25981 ctgctaattc tccaacttac ttttttcccc 
26041 atattagcta aatcctctaa tctctaccat 
26101 tcaaacatct ctgtttcttc agagtagtac 
26161 aacttgagag tgaatctcca gaaagaaaga 
26221 ttctacggct gtctcttctt cttggcctct 
26281 acgaaggtat gtgaagctta tttgcttaat 
26341 ttttctccaa ccttagtcta gacatctcaa 
26401 tattttcccg agaaaatcag tagaatctgt 
26461 tcatctctct aactctctgt gtacagttga 
26521 tgaagaccct ttgctcgtta tgtctaactg 
26581 gtccggcgtt acttgctctc cctctaaaga 
26641 tctgcaacaa catcacttct tttatgagtg 
26701 gtcgtgcaga aacatatctg cttcatctat 
26761 attaaccttc cttcaggaac tgtacgcttt 
26821 tatagtttga tgtttgtgat tttgacaatc 
26881 gaaccttcta ctcgggacaa taccaaagga 
26941 ggacttagga aacaatcatc tcatgggtcc 
27001 cattatcatc atgtaataaa aagactcaac 
27061 ataccttttc ttcataatga cgcttttgtt 
27121 gtttaacggg aaagctgcct tcagagattg 
27181 tcgacaggaa taggcttcag ggaagtcttc 
27241 aattgtgagt ttagcaaaat agtatgaaac 
27301 gggattttgg ttttactcaa ctctgttgca 
27361 ctggtctgtg caagtcttct cagttgaaag 
27421 gaaacattcc aaagtgtttg gagcactttc 
27481 ttcgagctct agacatctcc agttttgctt 
27541 ctaggacgag ctttcaaggg aactgcatgc 
27601 cccaatgcgg tgtgtaagat atagcttttt 
27661 gtttttacta aaatgtaata atccgtgggg 
27721 gaaagtccca gctccccacc aaagcaccag 
27781 acatcaagac ctaagtggct tcttgcgctt 
27841 ttctgcctgg ttgcactttt ctcagcagtt 
27901 attccttgga agaaatcttc aagcctaaag 
27961 tcatcatctt tccaaaaaaa tcttccaaca 
28021 gctttttgtt ctcttattac tccagattct 
28081 aggcaagagc ttgaggtggc gtgtgaagac 
28141 agtcaggtct ataaaggaac aatgaaaggt 
28201 gtcaaagaag aagattggac cggatacctc 
28261 gctcatgtta caaagaaaaa ctgagccttt 
28321 cagatttggc tagactgaac cacgagaacg 
28381 cctcaccgtt tgcaagaatg cttgtttttg 
28441 acctccactg taaaaatctc aacttcctca 
28501 tgttttagac tatagattaa cattgctatg 
28561 ctttggtatc atgggcaaga cggatgaaga 
28621 accttcatat ggaactggat cctccattta 
28 681 atcttacaga agattttact cccaaggtaa 
28741 aacaatacag ttaagcttat gtttctatga 
28801 cgattcttgc gagatcagag aagaatttga 
28861 tgctcccaaa cgcaatggag agtcgatatc 
28921 gtgttctttt gctggagatt gtgagtggaa 
28981 taattgaatg ggtaaagaag aagctattct 
29041 tcctttttct ttacaccccc cgttacatgt 
29101 cagaggcaat gacgagtttg gtggatccgg 



tcttgttatt tgtagtaaag ctctgaaaca 
aaaaatatga tacgatttta acagatacgg 
atagcaatta cgattgaaat ttcacgcaaa 
tacaaatcaa ttcctgattt ataaatattt 
ttttgtcaca taatttgtta ctattattat 
ataaattatt tcattgcaat atttatagat 
aaaagacatt ttctaccatc caatgaaaca 
ctctctctct caccccacac ttcaccatct 
ttttccaagt gagtttcctt tgataatagg 
ttcattctct ctctttgcta atcttggagc 
tccacgaaag ttgccgtctt taatgagtcg 
aaatgagatc gaactgttgt tctttacctc 
tcttcatctc ttgtgatgcc ttcgcctcta 
cccagatctc agattctttg gatcgaatcg 
tcgaccatga gaaaaaaaaa aagattacta 
caaagttaaa agcttttaat ttatacaaaa 
agctctaaga agattcaagg aagcaatcta 
gaacgacccc aattcacatc cttgcgcttg 
ccatgtcatc aagatgtaac actctcttta 
agatttggct tgttgatttt gatctttctt 
taaagggttt cttgcgcctg agttgtgtca 
ccatctctct ctctctctct tgacataata 
attgaatgct tctggtagag tcttacatgg 
gattggcaag ttgagtaatc tcaagatctt 
tatccccgct gaggtcggga gtttgtccag 
cttttacagc aaccaaaact gacggtttca 
tcacattttg cagaaacctt cagtccaatg 
gtaacttgaa gtaccttaaa gaacttcata 
tcgccgctgg atcatctggc tatccgtcaa 
acttctgtct gcaatggcaa atgttttaat 
ggttttcttc aaattcaagt gctaatactg 
tagctgactt ttcatacaac ttctttgtgg 
caaggtattc ttatcatcta ggttgtcttt 
gagagtttct cctttttttt atggtcaaaa 
aaaacaagga tcttaagcac agaccatctt 
tgttttcatt tcatggccca gctggtcttg 
gcaggtaacg cacagctggt caaaactcat 
tcagctgaaa ttgtggctaa gcatcataaa 
gagatagtca caggatcaat ggtcgctttg 
caccgctggc aaaacaggcc ttctctcatc 
gaaaagttcg cagtctacgt tggttagaaa 
ttccaattca aacaactcaa aataagtgtg 
gaaatgctca aggatgtctc gagattcaca 
ttcagcaaca tcattggttt atctgctgat 
ggacctgaga ttgctgtgat ctctctttgc 
gagctctatt tccagagaga ggtagcttct 
tctaagccat ttttgcactg tttcaggttg 
cagggaaact gctgggatac tgcaaagaga 
agtatgcatc aaacggaacg ctatacgagc 
tcagagcttt tgtgttgcat catactatac 
ataaatgttc tgatgcagat ggggaagcgg 
ttgtcatagg cattgcacgc ggtctcaagt 
caatctctga gctgagctca aacgcgatct 
aaagtaaagt gagaccagct ttgttatctg 
tgcagctggt tgattttgaa tgctggaaga 
gaaatattaa cagtgaaggt gcggtatgcg 
tggatgtgtc tggtaatata tatgcatttg 
gacctccttt ctgcaaagac agaggcttct 
ctctgaaaat accttaagct gtgagtaaac 
tgttgcaggc aaaggagtat cttgaaacac 
agctgaagca ttttaaccaa gaagagcttg 
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29161 aaacggtgtg tgaggtggcg aggcaatgct tgaacaggga tccgaacaac aacagcaaca 

29221 ataacaacaa gccatcagtg caggagttat gtgagacgtt ggagagtaga atcagtttgt 

29281 cgatttccgc agagcttaga tcatcatctt tggcttgggc tgagctcgcg cttgactctt 

29341 gatgaggtag aggagagtcc tgaatgatga tgaatggcta actggaaaca atgtaaatca 

29401 gtcgacacgt ttaaaagttg ttattgttct cttatcttga caagtgtgtt gagtttctta 

29461 cgtgttgctt tgattagtgt gtttacagtg tgtttgtaaa ggaagagata aacatagagt 

29521 ttgtaaaaag attttttaaa tttggaaatt ctatgagagg ttaaagaatg tgtgtgtagt 

29581 gtgaaatagt tttggaactt gtatgatgct atctttatgt tatacatacg gtgtcttcca 

29641 tattcattca gacaaagagt ttgctggttg aattgaaatt tcataaacgt cggattcaac 

29701 ttaataaaaa agtcttttga ctattttaaa aagccgccaa tttcatggct ggtacgactt 

29761 atctgttctc acgacgttga gtgccatcaa agctcagacc gccaggtata atagattcca 

29821 tcttacctta aactggatat gtattagtaa atttcccgtt gaattttttt tttaaaatac 

29881 attgtttttc tagcacgcaa cggttgcata gtacgccgcc atatgccaaa tatgtaactt 

29941 gaaagaattt acttgtcttt ttcttgcaaa tggagaagaa aataagatga caaaagaaaa 

30001 acaagtaaaa ctacttcacg ctatagttta tatacagttc acatatattt tatattctct 

30061 tacgttcatt tttgaaatat tttaaataaa aatttgaatt ttactactcc atagtatagg 

30121 atgtcataac agtgacataa ttatttaatc aaccaatatt ttagtgaacc taataaaggc 

30181 atataccatt tacttattta gacactgaat ggtcaaaacc atactacgga atttagaatg 

30241 gtatttgtaa aagcactttt aatttaccaa ataacagcaa ggcaaacact taaattttcc 

30301 actaagtaat ggaatacaga tcagaacaca cagaaaacaa tagtaaatta ttttgttcag 
30361 ttacactgat tatttttaca tattactaaa atatgttata aatacagttc tcatagtctc 

30421 atactggtta ttttctttat ttctatttta ttttaaatat gaaaattata ttacctattc 

30481 ttactaatat ataaatgtta taaaattact atatattttt tttaaaaaaa tatttcttta 

30541 taatataata tactttggca atagaatttt gtcatataac cataattaat ttcactgaaa 
30601 gaattagata gaagagagta tttacggaaa gaacccgata gtttggatat ttatataatc 

30661 ggcgcatata ttttgtcatt taccaaccga ctcctttttc cctctcatct gatctatatt 

30721 taaaaagctt accccattcg aaagtataaa cgtaaaagga atcaaaattg taaacgacta 

30781 aacacctggt ctttcctccc ccaatggcca cgaagctatt gtcgctcaca tgcatacgca 

30841 aagagagatt cagcggccgt taccctcctg tgctgcggaa acatctaaac agcaggccac 

30901 gcggcggcgg agatggatca ccgtcggaga cggcggtgct taagatcgac gaggaggagg 
30961 cggtctccga tgcggttttc cgcgtcacgg ggatgacatg ctccgcgtgc gctggatcta 

31021 tcgagaaaga gatcaagcgt cttcccggga ttcacgaagc ggtcatcgac gctctcaaca 

31081 accgggctca aatccagttc taccctacct ctgtcgatgt aagttttttt cttttttttt 

31141 tttgattgga aattcaatat ctgcttcttt ttattgttat atcggaatcg atcaagttgc 

31201 gtatctcttg gttgatgcac ttcctgaacc tacatgataa cgtgttttta tttactgtgg 

31261 taaattagga taataggttt cgttatctat tttaatctgg attgttgtac tgattgggta 

31321 ggtggagacg attcgtgaga ctattgaaga tgctggattt gaagcatcac tgatagaaaa 

31381 cgaggcgaat gagaggtcca gacaagtatg caggataaga ataaacggta tgacttgtac 

31441 ctcatgttct tcatccattg aaagcgtact gcaatcactt tacggtgtac aaagagctca 

31501 tgttgcctta gcaattgaag aagctgaagt tcattatgat cccacgctcc tgagctacga 

31561 taaactactg gaagagatag ataatgctgg atttgaagcg attcttataa gcacaggcga 

31621 agatgtgagc aagattgatt tgaagatcga tggcgagttt actgatgaag caatgacgat 

31681 gattgaaaga tcgcttgaag cacttcctgg ggttcagagt gttgagatca gccatggaag 

31741 tgataagata tctgtgttgt acaaacctga tgtgacgggg ccgaggaact tcattcgtgt 

31801 gatagagtct actgtctttg gtcatagtgg tcacatcaag gcaacagtat tctccgaggg 

31861 aggggtgggc agagagtctc aaaagcaaga agagatcaag cagtactata agtcgtttct 

31921 ctggagtttg gtttttacgg taccagtgtt tttgacagcc atggtcttta tgtatatccc 

31981 tggaattaaa catttgctaa tgtttaaggt catcaatatg ctcaccgttg gagaaatcat 

32041 aaggtgggtt ttggctacac ctgtacagtt tatcatcggc tggagattct atgttggctc 

32101 ttacaaggct ttacgccgag gatcagctaa catggatgtt ctgattgctc tgggaacaaa 

32161 tgcagcttat ttctattcgt tatacacagt gttgagagct gcaacatctc ctgatttcaa 

32221 gggagtagat ttctttgaga ctagtgccat gctcatttcg tttatcatac taggaaagta 

32281 tctggagata atggcaaaag gcaaaacatc tcaagcgatc gcaaagctta tgaacttggc 

32341 accagacact gcgatattgt tgactgtgga cgaggaaggg aatgtgactg gtgaagaaga 

32401 gattgatggc cgattgatac agaagaacga cgtgatcaag atcgttcctg gtgctaaagt 

32461 agcttccgat ggttatgtta tatggggaca aagtcatgtg aatgaaagta tgataactgg 

32521 agaggcaagg ccagtggcaa agagaaaggg tgatactgtt ataggaggca cactgaacga 

32581 gaacggtgtt ctgcatatta aggtgacaag ggttggttca gagagtgctc ttgcacagat 

32641 tgttcgactt gttgaatccg cccagctagc caaagctcca gttcagaagt tggctgatcg 

32701 gatatccaag ttctttgttc ctctggtgag tagtattctt gtcggttgaa attcaagttt 

32761 agtcttttaa aggcacaact acttcttaaa cttacgaatt ataatattgc aggtaatttt 
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32821 cctatcgttc tcaacttggc ttgcctggtt 
32881 atcatggata ccttcttcaa tggatagctt 
32941 catggtcata gcttgtccat gtgctcttgg 
33001 aactggggtt ggtgcatccc aaggtgtgct 
33061 acacaaggtt gagcctcttg cattgtaact 
33121 ttttgatttt ttttcagatg atgaaaccta 
33181 attgtatttg acaagacagg aactctcacg 
33241 ctcctgaaaa acatggtact tcgagaattc 
33301 tcttgtaact atcaaacaca tatacaaact 
33361 gatgtcacta tctttttatc aggtaaacag 
33421 atatgcaaag aaattcagag atgacgaaga 
33481 tgtgtctatc actggaaccg gagtgagagc 
33541 aaacaagagc ctcatgtctg gttacaaagt 
33601 agcagaagct gaagagatgg cccagacagg 
33661 tggagttcta gctgtttcgg atcctgtaaa 
33721 aaaatccatg aatatcaaaa gcatcatggt 
33781 cattgctaga gaagtcggta tcgactctgt 
33841 agagaaaatc aaggaactac aggtatgtag 
33901 aagattgaaa atgatagaaa caatgattct 
33961 tggagacgga gtcaatgact cacctgctct 
34021 tgcaggaacc gacattgcaa tagaagcagc 
34081 agatgtgatc acagccattg atctatcgag 
34141 cgtatgggct ctcgggtata acctcatggg 
34201 gtctactcgg ttcaggttgc ctccgtggat 
34261 cagtgttgtg tgttgctctc tcttgttgaa 
34321 cctggcgatc cgtgaggttc aagtggagcg 
34381 acagattact tggttgtgtg atttgatgaa 
34441 aaatttcaca tgattggttc aatggttctc 
34501 caaaagatga tgctttatcg tattcttctt 
34561 aaactaagaa tgcatgccgt tcaaatctag 
34621 accttatatc atttgagaat aaaaaaatac 
34681 aatacataaa tctgacttgt gggttatgac 
34741 tttctttact cgtgtccttt ggatgcattt 
34801 tcaggcgcaa taaagctgaa ataataagca 
34861 gcattaacag cgcgtgagga cggataggcc 
34921 agctttctct cttttgtcct cctagcagct 
34981 aaattctact tctatggcga cgaaagaccc 
35041 gacaattgct taacttcaat caacactctt 
35101 tgctaaaagg aatgatcttt ggtttaggtt 
35161 aatggttagc ttcttcttgt ttttctcaag 
35221 gtattaccct cttttttttt gttgttttaa 
35281 atctcttctt cttgtttttc ttggaacagg 
35341 aatacaaaga caaagttctt ctaattgtca 
35401 tacacacaca cacctcttag ctataaactt 
35461 gtttctgatg gtctcagtgg gatgacgaac 
35521 aacaagtaca aagacaaagg taaaaaactg 
35581 tgtttcttcc cagctgatta gcttttttca 
35641 tctggagatt ctagcatttc cttgtaacca 
35701 ccaaattaca gagtttgtct gtacccgctt 
35761 aaagaacatg aattggttgc aagttgtatt 
35821 tagattgaag tgaacggaga gaatgcttcc 
35881 tggggaatct tcggggacga gattcagtgg 
35941 ggtcaagctg ttgaacgtta ctacccaact 
36001 ccgtctcttt ccgtctctga agtcctgatt 
36061 tatgttatgt cctcttgttt ttttctgcag 
36121 tgaatggtgg tgttatccga ttcactctgt 
36181 acttgtcatg taataaagca aaatcacatt 
36241 tttcacaaat ccaaaccggt tgcactcagt 
36301 agcattacct catcataata cacaaaaaat 
36361 atacttatta atgtgttggt ttctataagg 
36421 agacctctct actgatgaaa accaggtttc 



cttagctgga aaactgcatt ggtaccctga 
tgagctagct cttcagtttg gaatctctgt 
gctggctact ccaaccgctg ttatggttgg 
gataaagggt ggtcaagctc tagaaagagc 
tattatatta tatatgattg ttctgtacca 
acagatcgaa tgaatgtgta ggtaaattgc 
atggggaaac ccgttgttgt taaaaccaaa 
tatgaacttg tggctgcaac tgaggtaatc 
cgtggcatgc ctcaaaagtt tgatctctac 
cgagcatcca ttggcaaagg ccattgttga 
gaaccctacg tggcctgaag cccgtgattt 
gactgttaaa ggaagagaga ttatggtggg 
tactattaca gccgatgctg aggagttgct 
aattctcgtg tctatagaca atgagttaac 
accgagtgct cgagaagcca tctcaattct 
aacaggtgac aactggggaa ctgcaaactc 
tatcgcagaa gctaaacccg agcagaaagc 
tacctccaga gtaaatatct tttaaaatat 
gtaggctgcg ggtcatgttg tggcgatggt 
cgtggcagcg gatgtaggaa tggccattgg 
tgatattgtc ctgatgaaaa gcaacttgga 
gaaaacgttc tcaaggatcc gtctcaacta 
cataccgatc gctgcggggg tgcttttccc 
tgcaggtgct gcaatggctg cttcttctgt 
gaactacaaa cgtcctaaga ggcttgatag 
ggtttagaaa accaaactaa accgatccga 
ctgtatgatt tgttacgtta gcataataag 
aggatggttc ctgataatat ttacctagca 
ctagaatact ggaaaagaaa tgtcacaagc 
aaaaacgctc cacactgaca agggagagta 
ataaatctga tttgaccaaa aaaaaaaaaa 
ggcagatgct gagaaatggt ccgagatttg 
tatttgaaga tgatgcttta gttttacgca 
accgcaacgg cccatcatta aaagccactc 
agagcagaga ctgcattttt catgcctttt 
gactcttctc tctcatttct cgtgcttctc 
agaatctgtt tacgatataa gcatcgaggt 
taataataga aaaaaatgtt tgtgtgtttt 
tttttttagt gtgtaaaaaa gagatccatg 
aaagaaagac tattactttg aacttaagat 
agtgcaagtc tttatctaaa taatcgctat 
atgcaaatgg aaacagctta gaactcagtc 
atgtcgcttc caaatggtta agctttcttt 
ccattattgt aattttgatg ctaaacgttt 
tctaactaca ctgaattgaa tgagctttac 
gtttcaacca aaaatcactc aaagtctctt 
aagacttgat gttttgtttc tgctgcaagg 
atttggtgaa gaggaacctg gaactactga 
caaatctgag ttccccattt tcaacaaggt 
gatgttgaat ctttctccgt tgtgtgtttt 
cctctttata agttcttgaa aaaaggcaaa 
aactttgcca agtttcttgt tgacaagaat 
acttctcctc ttacactaga ggtaatgaaa 
cctgaacact ttttttactc tctttgtgct 
catgacatca agaagcttct gaacctctca 
tgataactct actttggtcg tgactcgtga 
tgactttggt tccaatttgt tctgtttacg 
gactcacggt taatcaagtt tagtaaaaca 
gttttagtaa atcaacttta gtaaaataaa 
gaggaagaac tgaataaata ctcacggtta 
ttggtgacga ccacaaagaa agagagagag 
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36481 agctttatgg gatagctaaa gttaaacatc 
36541 tagctagttg ctacattaac aatgttacaa 
36601 gatttctggt tttaacattc tcagataaaa 
36661 ttcacattct gcacgtatgc tgctgctgct 
36721 aaataggaag aaagattaaa tactgtctcg 
36781 cagtgagtag gtttcttccg ctagtagagt 
36841 cttcccctga accgggccat ccactcattc 
36901 gacccacctg tgcttcttct tcttcttctt 
36961 ccttgtgctc ttcctttggt ttattttctt 
37021 ctccgtatat aaaactacaa gccacaacct 
37081 ttctgttaag agacattaca aacaagaaac 
37141 agaagaggaa gaattgagaa acctgaacga 
37201 cgccaataac atgaccttca ggaccagaaa 
37261 tggatccacc ttcttcattc accaaataag 
37321 cctaacaaca acaacacaag gtgattgata 
37381 gagtttcttc agttaacaga atgaaacgat 
37441 tctgttgtag cgggttgacg tagagtgact 
37501 caaagagctc gaggtctttg ctgtgagaat 
37561 aaaaataaaa gaaaacagat acagaatctt 
37621 attgaaatga ttttttattt ttttctggag 
37681 catgaggtgc aaaagcaagt ccagctgaag 
37741 gaggtaaaag attgctctta aattaaaatt 
37801 ccattactta gctatacaca gattataaca 
37861 cagcttaata cctaacaagc aacgttctgt 
37921 actaatttgt ttcagcttaa gatttttttt 
37981 attagaccac acacttggat cattaaatta 
38041 ctgtcttaat ttggccacac aatcatctat 
38101 taaccataca tctaaactca agttcttcac 
38161 ctcaccaaga ttagctaaac gttgcttcct 
38221 tttaggagca tttggatcag tcactgcaga 
38281 agaagaacaa ggcacaggag aaagccctaa 
38341 cctcggccgt ccacgcttct tcttaaccaa 
38401 ctgcacagaa gcagaggaag ccatccccat 
38461 gtggtcggcc atggagaaat gaggacccgg 
38521 gttcggcata ggtctcattc ccggcggcgg 
38581 aggggagaga ttggtgaagg ctcctctttg 
38641 tctcccgtcc atgcatttaa atgccttaaa 
38701 ttttagatga aacattgaga ctttatgaag 
38761 accctaattc cccaaattag taaagagaac 
38821 acctataaag aaggaagcta aaaatcttga 
38881 aaaggaaatc tcaagagaag agaaaagaga 
38941 cgagatgata aaaacgcaaa tgtacatcag 
39001 agaacagaaa agagagaaga aagagagaga 
39061 aataattgtg aagagagaga atgaaatttt 
39121 aagaagcgag gaaggagaga gagtgacgga 
39181 attttaaaat tattttttat tcaaacaaaa 
39241 acttgtttta agaatagttt tattcaaaag 
39301 tttaatgtct ggaataatgc atatacctct 
39361 tattaatgcc tttcatgtat aaattatagt 
39421 aatggtatgc tcatctcagg ttaacaatat 
39481 gaaaaataaa taataattta caaatttgca 
39541 atctagtccg tttggtgaca agttgacgtg 
39601 tagaaaccaa gagttgaatt atgatcccat 
39661 ctaaaagaag actcatccac agtcatgact 
39721 caaaaaatta gaaactataa aaataaatac 
39781 tatatataga atgatataaa acatatgtat 
39841 aagtaataca gtattatact aattatctgg 
39901 agtggtcagt cagattttga atgaatagtg 
39961 ataaacttta aactcaaaag ataaacttca 
40021 caaatgctaa acatcaatca taaacataaa 
40081 taaattttgg ataaacccta aattcaaatt 



tccttaaata ataacctgaa gaaaggggtc 
ggaacacaga gatctgtctc aattctccga 
tctacaacaa caaagaagat atttacagca 
gctgctgctt caatagctaa gaaggtagta 
ttctgaatca tccgcgagtc aagtcaatat 
caagtgatct gcctgagact gagactgaac 
cctgacctga gaagttctgt ggcgtctcct 
cttcttgtgg attagtctcc atttcgctat 
gtttgatagt cttgttactg ttattaactc 
tcaaacaaca acaaaacatg ttttttttat 
caaactcata ataccacatg aaaccaataa 
ggctggctgc aataagcatt ccaatcccac 
gggagacact caaaccgcct gttcggcttt 
atccccctag acttagtatc tcaaaacgtc 
taaagaactt cattagtttc ctcttgtttt 
ttttgaaaaa cctcatatgt caaagaagac 
gaagaaactg ttccggtgcc tgacattata 
gacatgactt ttgaaacgat atcctgcgag 
aatatattaa atccataaat atattattag 
agcttacttc tcctgctcca acactgatga 
tgttcatcca ctcacctaat gtaatcaaca 
tatattgaag actaatcaca tttcaatgac 
cacatggatc acttaaaatt actttttcca 
aggaattaca acatatccaa atcaaaccaa 
cactaattac taatctaaac aaaagtagag 
tcagtaacag cagagctaaa ccaacatagt 
ataccttcta agcatcactg gactttcaca 
agtgtaacat tgaaaaaaaa cacttgagat 
tccagtacca ggaggtcgac ctctggctcg 
agagtcctta gctgctttac tactaccacc 
agagacttgt ccatcaggag gagcgtactt 
cggctgctcc atctgtggcg gctgcagcgg 
gtgaatgttg tgtccgaaat cagagtgtct 
gttgttagct tgaggatgat gaacgttagg 
cggcgcgtgg agaccactcg cgacctggga 
gaggtagtag tgaggaaatg acattgcttc 
ttacaagtct tttcaaatgg gttttgagcc 
aagagagtag gaactaccca gaaatagtaa 
aacaatggcg gatttttttt tttttgtgaa 
gctgttacta gtactgttag tctaggagag 
gagagatttg atttcgaaat ctagggtttc 
aggagaccag ggaagatgac agaagagaag 
ggtttataca gtagtttttg atttttttta 
aggccttgtc tgaaagttag aagactgaag 
ctgacgacca ctattttttt tcctttatat 
ctattaaacg gttttaataa tattaaattg 
cttttgtact ttccgttttt tatagttgaa 
cactaaaaat gtaatttagt cagatatata 
atgcttttta acttttttta ttctcatgaa 
ttacacaatt aaaataaaag taattggcat 
aaatagaact ttagtggaat tgaggggtag 
aacaccctta tcaatctgtt ggctgagaaa 
cagtataatc acaaattatc cacataaaag 
atactctatc aaacataatc ataaactcat 
caactaacat ataaaattat ctggataagt 
cacgatgcag tctaccctat tcggatatat 
ttatttttat ggttgttggt tatttaagtt 
aacttgatac ttacctttat attataaatc 
aatcttaaat ctaatagata aaccataaac 
tgataaacct tgaacctaaa agatcaagct 
aatagagtaa ccattaatcc tacaccttaa 
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40141 acagcaacat gcaaacttta tctttagagt 
40201 tataatacta agttaactat atggtcaaca 
40261 tttactaatt agatttttcg gtatcaataa 
40321 tcattattat caacaagtat ttacatcata 
40381 ggaatcataa aattatttaa gggaatgcta 
40441 gtttctattt caatcattaa atatatatat 
40501 gtatctcaag atttttccgg tcatcgtctt 
40561 tcggggtcct aatcgggttt cggttttatc 
40621 aaaccagccc tattcggatt atttgaaagt 
40681 gttcgggtcg gggttagtaa atcttcaaag 
40741 ttcgggtccc aatcgattat tcagtttaaa 
40801 aaacataagt aaaatcgatt ttttgggttt 
40861 aaaaacataa gtataatcga tttaaaaata 
40921 atcaaatgaa aatgaaatat agttattgat 
40981 aatgaaaacc gcgttctcat gaaatgtaaa 
41041 taaatattta aaaattcaac agccacattt 
41101 aacattcatg taatagataa gtattttata 
41161 catattagga attgagatca tgtttgatac 
41221 gttctatcgg atatccattt agtttcgggt 
41281 accatgaaac aagatccatt cggtatttat 
41341 tatcggatcg ggttcggttc gaattttcgg 
41401 accattctaa actccaagaa aatgttctat 
41461 attaatagct gtgctgactg ctgagtaaat 
41521 tctattcatg tctttccggc atgaacaaca 
41581 ttttagtcga aataattact ttaatgtatt 
41641 atctctaata tattatttca tttcttactc 
41701 atttattcca atgtattact ttatttttta 
41761 catttttatt taattaataa tttttagtaa 
41821 tccaattatt ttatgtttac aaaaatttca 
41881 attattaaaa tacaataatc ataaatatat 
41941 tgaacattag aatatatttt ccacaaatgg 
42001 gttttttatc tttgattttc ctaaatcagg 
42061 ttcggacaat acttcattga ttttagcggt 
42121 tttatttatt ttatattttt atgtttagtt 
42181 tttaaattat taaataattt cttatgaaaa 
42241 aatattgaag tattaaataa aaataaggaa 
42301 agactatttt gtaaataaat aaaatcaact 
42361 tatatagata gagtaacctt tgaaaattga 
42421 gatgtttaaa atagaaaatg gaataggatc 
42481 ttttcgtcaa acacctagta tgctcttatt 
42541 ttttattttg gttcttgatt aaaaagaggg 
42.601 gagaggagag ggaagcggac ctacttgaaa 
42661 tctctctcac atgtctgaat cactttgcta 
42721 ctttcctaac tttatttgtt ttcacttttt 
42781 aaaagtcttt ttttatttct ctcaatattt 
42841 actatatgag ttccacactt ccacctaaat 
42901 taatatccgg tggctaaaga gttccgacca 
42961 ttgt aetata tatttatata gtcttttttt 
43021 gttttgctaa ttttccatcc tctatttata 
43081 aattaatatg eggtccttaa tetttgeact 
43141 agttatggca aaattatggt gtcaaattac 
43201 ttgacaatga aataaaataa atttgacttt 
43261 taaacttagt aacttgtcca ttttataaac 
43321 atatttgagg ccgaacacta ctgaacttgc 
43381 ccataaacat acttaaatgt gtccatataa 
43441 tatcagaaag tatttctcag aaaaaaaaat 
43501 gatttactaa aagtggaaga agagtgttca 
43561 ccaaaggtac aacaaggctt tatttctaag 
43621 ataegtatat atacatagag agagagagat 
43681 acatgaegtc attaatagtg tgtctttctt 
43741 ttattttcct tctgatgtat tttgtgggga 



aatataaatt caaaacaata aatcatttta 
ttgetttaga cttgttaaaa acatctatat 
attaagctgt gtataccagt tatteggata 
aacatatagc cgaaaattac attataatct 
tataatcttt tcatgagata caaaaatgaa 
atatatatat atatatatat atatatatat 
caacacttga tttctactag gcatgggtat 
cattcagatt teggttttte gggtttatca 
tcggttcggg aceggttegg gttttategg 
aaceggtata atccaatgta ettteggaag 
aatatctgat ttgtacctac tttgtaacta 
caaataccta atttgaacat ttttgtaacc 
agaaatgaac atcaaccatg atcatttaaa 
aaaaagaaaa gcaaataaat gaaaacataa 
acattgttta atgaaaacaa aatctaaatc 
aaccatcaac cttcatgtaa tagaatcatc 
tgttcagtat atcttaatgt attttgaata 
aagatatttc tagggttttg aatgtttcgg 
teggtaegga taatacccat aacccgaaat 
gtcgggttcg gateggtteg gattcatttt 
atteggttta tttgcccagc cctaatttct 
attgttcttt ttgttcatgt caagagatct 
cgtatgtatc attaattgat tggagtttgt 
aaatattaca caatcttaat taagtaaaat 
aacaactagt ttttagtggc cagcaagcac 
taaaacggtg taaaccaaaa atagaattga 
ctttaaaaat gaatactctc aaaataattc 
taccctcact tataaaaatt atcaattaac 
taaaatataa tttatttaac ttaaatattt 
ataagatata tagacattct ggttcaataa 
ttaactaata cattttaaaa tgaataacaa 
gaaaaaatat gaaatcggtc atttttatcg 
tccataaatg atttaggaga ttatacttat 
ttggtatttt tattagttta tacatcttta 
taatattttt catgttattg tatattttta 
aatatgaaat aaaaatattt aaaatttaaa 
ctaaatagaa taatgaataa aattactcta 
gtgagattta agtatatttt atttaaaaaa 
aaagatgtcc cctgatacat tcgtgtgttt 
gcaagattta aaaacagaaa acaaagegae 
tgc cat gaga gagatgtctt gttgactaaa 
agagecaaga agagagagga tatctctctt 
tttctttctt tgetteggae agtgagcctc 
atattcgccc ttttcattta attttttatt 
cttttgaaaa tgattcaaga aaatatttat 
tgaaagcttg tatgatgeat ttattgeatt 
ataaaatcca atttgtcaac attttgtaat 
ttgttttttt atatagtctt tttagatttt 
tcctctaatt tctttttttt tccattttag 
taacttgttt tgtcaacaat ccctacagtc 
aeggcttcaa cgcttattta atttggcttt 
ccattctcac etttttgegt aactttcacc 
taccgatttt gatgaaaaga tegcataaat 
caaaaaaatt atatattctc acataaatgg 
catttttget agttcaagag agcaaaacta 
tatagaagac tgtgaatcac aataaatgta 
aagaggtttt cattttgagg atggtgtctg 
aaattgccgt agttaataat gtgtttcaca 
tatgtactat tataatagtg ggtctgtgaa 
gtaaggaaag tgatctcaac tgttgctttt 
taagagagat atgattctct tcacaattct 
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43801 
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43981 

44041 

44101 

44161 

44221 

44281 

44341 

44401 

44461 

44521 

44581 

44641 

44701 

44761 

44821 

44881 

44941 

45001 

45061 

45121 

45181 

45241 

45301 

45361 

45421 

45481 

45541 

45601 

45661 

45721 

45781 

45841 

45901 

45961 

46021 

46081 

46141 

46201 

46261 

46321 

46381 

46441 

46501 

46561 

46621 

46681 

46741 

46801 

46861 

46921 

46981 

47041 

47101 

47161 

47221 

47281 

47341 

47401 



tttgtggtcc 

ggtgttgctc 

atacattgaa 

tacaaataaa 

agtaacaaaa 

gattattacc 

tatatttggt 

aaattttata 

aattgacttg 

gtgtttattt 

tagctaccct 

tcactaccct 

taacatagat 

atctcattct 

actccacaga 

aatgtataga 

ggtaaagaaa 

gcgaagcaga 

tcgccacatt 

tgcaacatac 

tttagataaa 

gttcttcctc 

ccaaggcaag 

gcaattccaa 

acttttctag 

cctaagtgat 

ccaaactcat 

gcgcaggaag 

taacgagggc 

gaaaggggtt 

attcaatacc 

ataacaccat 

accaagggac 

cggaagaatc 

ttcttgcaag 

aagaatcctt 

cgaaaatcaa 

gaggcagaca 

aggaaacggc 

tgttctctat 

catcatcttt 

acatactgta 

gctctttcct 

tcctcttcga 

atgcatatta 

agcagctgct 

agctgcatca 

tcattatata 

ctatcacaaa 

caaaaataat 

aattcaacct 

ctatgcgtat 

taaaaagagt 

agaagggaac 

attgtatcat 

ttcagctcac 

ttctgtaaac 

tcacacttaa 

aactttgcta 

tgtttctgta 

acatgaacaa 



ggatatttct 

tagttttttt 

ctgaagtttg 

actgttgaaa 

atcttgtcac 

caaaaaaaca 

tcgactggtg 

gtctaaaagc 

agaaaaacta 

tggttctcga 

tttgatgcgt 

ccttgcccgt 

caatcaaaca 

ctttttttta 

acccacatga 

ggctacaaga 

caaacctgga 

cataagcctc 

gactacaaga 

ttcgagctcg 

cagtaaaggt 

tgcaatgacg 

agcacccgag 

agtggtagca 

ctcgttttgc 

actcttcagg 

caaccagatc 

ttgatattat 

tttcctagat 

tgtaaccaca 

ctattttgta 

cctagtttta 

agaccagatg 

gtgctatgct 

agcgggtgat 

ggctcttcaa 

aacattgaac 

aacctttaac 

agataaaaag 

gcctttcaga 

ttgatgaaca 

cgagttcttt 

cccaaaacct 

tttgtagtct 

catattctaa 

caatgaaatc 

aagatatgaa 

aaacaaattt 

tagtatttta 

tagagaaaac 

caacaagagg 

gcaagtatgc 

gtcaaggatt 

tcaggtcact 

caaagcgtga 

gaatgttaat 

aaaagaaacg 

agatacagaa 

catttatcac 

agcgtcacct 

gtgacatagt 



ggtccagtag 

ctgaggtttg 

tatatttatt 

cttaaaacta 

gacaacaaaa 

ggctagcaag 

cgtacattag 

ttttatgttc 

tttaaaattg 

gtgaaaaacc 

ttacggcttt 

ctatggtcaa 

aatggattct 

tacgtttcca 

aaagtagctt 

gagtacgcac 

gggatcgttt 

aggccgccag 

aaccatgaat 

taatgcctgc 

tgtgttgtgc 

ggcaaatttt 

tctgtcacca 

agcaatataa 

ccactcttgt 

taatattttc 

attgtggtgc 

tatcagaaca 

cttctggtat 

attacatctt 

cctcagggga 

aacttgccgt 

ttctggcgca 

tgccatcatc 

tatcacactg 

gttgaacacc 

actctataat 

acaggaagtt 

ggctcagaat 

gagggtaaaa 

aaagacttga 

aaggtcggac 

cgctgtagac 

ggaaatagaa 

aatcaaatat 

cagagacgat 

ccacagaaag 

ccatgcattt 

aatatcttga 

atttaccgtt 

gagcccttca 

attgcaacgc 

agaagtaaga 

gattagcttt 

aatccaagac 

gctacaaagg 

agtgacttca 

caaaagtaca 

gggctagttc 

gaagctccaa 

gatcattttt 



tttcaaattt 
aaatgtgaaa 
ctttgtgttc 
gtattccaac 
gactttacat 
atatttgatt 
atttaatcgt 
tccattccaa 
aatgaataaa 
agggaaggat 
tccactgcaa 
gtgacggtgt 
gagaaatggg 
tggcaggact 
tgatggtgac 
gaataaaagt 
tttggagagg 
tgaagtttaa 
cttgagttgt 
atcaaatcaa 
ttgttaacgc 
caccctcaga 
cctggagaca 
tgagaaaccg 
ccagtgtctt 
agtgtttgtc 
tctttattga 
taccgcttct 
ctgcaatttt 
gaacacaagt 
aggtgaacta 
ttgaagtttg 
gtccactctg 
tgttttcgca 
ccacctctcg 
aagcgattta 
cacatgttac 
caagcagaga 
tctgaagcca 
cggcaccaat 
gaaatactaa 
atctgagttt 
aaagttccgg 
agtcttggca 
tagtggacga 
cttgcagcat 
agcactcgga 
tacattcaac 
taattgctgg 
tgtgcttttt 
actatttcag 
aaaaataggt 
ccttgaatgc 
cgttggtcct 
aaagcatcaa 
ccatcgaatt 
gaatgattcg 
gccaaccttg 
tgatttcttt 
tatctattaa 
agaataatga 



ctactagctg 

caaacaacaa 

actcttcttt 

gatctttcaa 

gacataaaag 

tcgaaacatt 

tatattcaag 

tggaaacatt 

ttaaattgat 

atgaacctga 

atcaccaact 

ctttggctcc 

aagaagcgat 

taggtttatg 

ttaaactacc 

ttgtgccaat 

attctgccgg 

cacaataggt 

atggcttcag 

accataataa 

tcatatcttg 

gataagctcg 

aaaccaataa 

tgggtgtggg 

ttagctccac 

cctgtcaaga 

aataaggtat 

aagtgctgct 

aagacgagct 

atcaaaacca 

gagcctaaca 

ggaagttctt 

gttatcaagt 

gtgtccaaca 

caactaagca 

gactgagcaa 

agaaaacaga 

acacggagat 

agtttcagcc 

gttgacagac 

gcatgggctg 

cactggcaag 

cgatctcaac 

tgaaattata 

cgtacacaac 

aaacctacat* 

acatgaaacc 

acaaaatgaa 

agaccaataa 

cacgacaaga 

atactgtggt 

gggaacaagt 

ctaaggatgc 

tgccttcacg 

catggtactg 

catgaaccaa 

acaaaagtgc 

agagatataa 

agcaaatcca 

gggaaaagaa 

tgataatagt 



ctggtagttt 
ccaaatcatg 
ttttgcattt 
ttaacggttt 
tccataaact 
aaatatgtag 
ttaatctggt 
acaaaaaaag 
ttatatttca 
agacaccgtt 
gctgcaccgg 
attgatctgt 
caggtaaagt 
ccaataataa 
gaatgaagca 
gtttttggat 
aggacagcaa 
gtgataccat 
gcaaacgcag 
tacaaacagt 
ctaacctgaa 
aatacatctt 
ctctcgttaa 
ttggtcatgt 
aagtctcttg 
agcacataac 
ctctcactaa 
gaattttctg 
taaacgatca 
attaattaaa 
agatggtctt 
ccaaattaag 
ccataatctt 
aatcttcagt 
gtaatgtttc 
ccaggtcctg 
aattccacag 
gaggcagtag 
aacgaaatgg 
atcctacgaa 
ggttaagaaa 
aatactgctt 
aaacagcttc 
atggagtagg 
agcttccgag 
cgatatgaca 
taatgctatt 
agaatgcgaa 
gaggcaaatt 
tgccttcttc 
tagtgttaag 
taagagacag 
tccatcctgg 
aacatcagcc 
cctcaggata 
atcttctaca 
atgttgttat 
aaccactcaa 
ggatcccttc 
cgattagaac 
ctacctctga 
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47461 ctttagcttg aaaagttcat ccaactcaat 
47521 tctcgccctt ttagctaaca tctgccgacc 
47581 aagggcaaga aatgaaagcg aacctggaaa 
47641 taaactttga aaggagaaaa gaggcaaata 
47701 aaagaccttc gtggatatgt aggtataaga 
47761 gcaaggattg aagaaaatat atctcatatt 
47821 tttgactcaa acaccccata atagaaactg 
47881 gtctgtccga accatggttt accttcgcgg 
47941 tacttaggga gactggtgaa ctggaaactt 
48001 acaaggaaaa acgggtaaca ctaatcagaa 
48061 gtcagagtga actatcgtgg ggaagttttg 
48121 gaacaatcga ttttctcaga caacttccta 
48181 gaaatgctca attgataaca taatgactta 
48241 acctttcagg tacatagact aacctttaat 
48301 gggtcaactt tcagcaactc atcgatgaac 
48361 cctttgcctt cgacttttga aagacagtcg 
48421 atcaaacttt ttgcaacatc ccgtacctgt 
48481 ataatccacc caatgagata tatacaaaat 
48541 ttcatttctg atccacccca aagaaactgt 
48601 tcactggaaa agacatcctg gagaaccttt 
48661 cgtgcctggc aaacttgcat cccctcctgc 
48721 actgtaataa gattcaaggc caagcaacca 
48781 acatgtttgg acattaatat ataaagggaa 
48841 actgctccta gttatttaat actagacttt 
48901 cgatgctgcc gctcatataa agtccaatat 
48961 tgcaagtaag tataaaacta cctttttgtt 
49021 cggccgttgt gaatttttgg tttcttgagt 
49081 taccattcca tgcaactcgg caagggtata 
49141 acattcacaa aggtgcttcc agtgctgaat 
49201 gacaagaaaa tcttaagaaa aatctgccta 
49261 tctatctaca aaaaaggcta gccaagttga 
49321 tacctccaag catgcaaaaa cagatggcct 
49381 aaactgctgg cagatgatgc atgttgtatc 
49441 tcggagtatt gtcttatgaa agctaacatg 
49501 acaatatctg attgcagaaa gaaaaaacct 
49561 gtagaaacat aggagaggat ctcaaaatac 
49621 tttccttgct atatattctg agcagttctt 
49681 agctgtttcc ctataagtaa aagacaatta 
49741 tagcatacac aattccacta gtaaccaatg 
49801 ctcttttaat gtagctatct tcagtacccc 
49861 taagtaattc tagaacaaaa ttttctttcg 
49921 caaccaaaac caacaaacac atactatctc 
49981 tcatgagata tgactgcagg tttacggtac 
50041 taaggtagcc aatcggcagt ggcaaaattg 
50101 acagcacatg aaagtttatc ttaaactgca 
50161 tagattagag ttggcgtacc gaaattgaat 
50221 acaaagtttc caggctccta gcagtgagaa 
50281 aggaataaag ttattattaa tcataaactt 
50341 tgaactatat acttgtatac caaacctgta 
50401 gcaaaacagc cggactcaac atagtaacca 
50461 agagatcagg tagggctttt cgcatgacct 
50521 acttttcttt ctgaaagtga gcaaatattg 
50581 aggaaaagga actgatactg ggccgtgaat 
50641 cacacaaata aaacaaacct tttcaaaagc 
50701 acattttgct tctcccctga aatgtagaat 
50761 gagtgaaatg tttctttatg acaaaataag 
50821 agaactcaaa aataacaaaa gaagaagatg 
50881 gataattcac agagtaaaaa caatggtcct 
50941 ttccaagata tagccaaggc actgtaacac 
51001 tagatccttt caacttaggc atgttattga 
51061 aaacatttgc ctcaactgat tctggtcttc 



tccaggaggt tttgaagcgg caagatagca 
catgaagaaa ataatttatc aaattcaaga 
atcgaaacca ctagaatagg agtatgctcg 
catctagatt caatatggcg tcagtctctc 
aaaatacgtc ctagataccc ttcaaaatta 
taatccaata tgcaatattt atatactcct 
cgagctagat gaaaaaatga ctataaatat 
aagatatttt ctttgacaag atcttgtgtt 
cggaatgcaa tagctccatc tgggcgatct 
agcataactt gaatgatttt actggatttg 
tagtggcata ccgttgggct acttgacaga 
gcctcttcag caaagtccta caaagaaatg 
caacgctacc aatccaaaag gaaaactgga 
ttaagataac cagactgaaa gcaaggaacg 
tccaagtgaa ctttctctgt atcattgttg 
ccaacagctt cagcccatat ctttgctttg 
gtgacaaata gaaaaaaaaa gtttttttcc 
ctttccaaca gatactcaaa actcacgcgg- 
tctgcttcct ttaaaagagt ggcatataca 
gatgctcgac gaagccaatc ttcagctggt 
tagtaaaaga aaagacagca atattaataa 
aatatgatat atacaaaaat ctaggttgag 
ggctcctagt tttttttttt tttttttgga 
atcaaatcct taaatacgga tgaagaatgg 
ggaaaacaaa aaaaaatccg tattgttatt 
agaagcacct gaccgtttgg tgcctgaact 
tttgcagtca ccaaactttt caacttcttg 
ccgatatacc aggcgcagtt ttgtaggttc 
tcaaaattat aaaaacttac agaaaatcga 
cagtaacgca cctttagaac cctcaacagt 
tccccctcta agaagttaga tatagatatt 
gcagctgcag acgatagcag aaagatggag 
ctacaaatac cattataaat ggtccacacg 
aaagagtgga caagcttaac atcaaaacca 
cttcgatgcc cacagaatca gggcattcag 
cacttttcca aagctgctct ctcaaagttt 
tcttcaaata tattgatcct ttgccatcgc 
aaacaatcat ctacatttgt tgcttgaagc 
tcaaattact agtagatact cccgtgatgc 
actatcccat ttgatcaagg gtaccaatat 
taaatgcaaa agatatatag accatttaat 
ttcaccttag ctaccacaca gagaagctct 
agcctataca gctccgcacc ataaccacca 
acggcctctg cacaattcaa acctgatagg 
taatgacaat ttgatgggaa cactttaact 
ccagcatgaa aggattttgg aaacgtgatc 
caagaacata aattatgttg gaatactttt 
gtaaatgcac tagaaatgct aacaaccaaa 
atactgtgta cacagggact ttattttctt 
gttgaaagag caaatctggc tgagcatcaa 
acagaaaaat cacgcagcac aagcacctta 
tagtatgcca gaagaaatat attccaaaga 
aactagccaa agattagtta caatctggag 
actagcagca ctgcctggaa caccatacca 
caaaaagaaa acaattaaga gacttattat 
gtattaagca aggtgtcttt gcgacttcca 
acgattacag tacaggcaca taccagtggt 
caaaatgcca acaaaaagca gagaagagca 
cattgatgtt atgccgaatg gcctgaagca 
gattccaagg gctatggcaa tattcatccc 
tatcacctat tctaggaaaa ccactcccgt 
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51121 aaacagaagt atccaagtca ttaccataca taacttcaac gtcaccacct gacccttcta 
51181 ctatttccca aaacttcttc tcaatctgcg tcctagacac cggccctgaa ccaaaccact 
51241 tccttttggc gcgatcagca atccgcttga aatcttcaag tagcaaacat ttacccggca 
51301 caaagccaaa agtgtcttca tcagtgttta agcaatcaag gcaataccag ttcccaggcg 
51361 ggatatgctt caacggcggc gagagacagt atatatgcca acctttatta caactatcac 
51421 acaagagcat cacttcacca tgtttgtcac ttttgcactg ctcacatgcc tgatcaacct 
51481 cttcttcttc actttccacc ttatgatttt tctcgccatt atttcttttc cttctcttcg 
51541 agctagagaa ctcagaacac cgtcgattcc ttttgtgacc ctttgcagaa ggatccgcac 
51601 tcatcatccc gtgatacttc tcaaactcat gcaaatgctc tttatacaac tgacacaaaa 
51661 catgcttagc acacttggag atcttctcac cagagctcat aaactgataa acctcacccc 
51721 atttcttccc cttgacaacc ttatcataac ccccaaacct cttcaccgcg ttaaacacct 
51781 tacacagatc caaatcaccc ccttcaaaaa ccaccctctt cttcaccttc ttccccaaac 
51841 gctcctccac aaacctcccg tactccagct gaaacgtctt ggagttgcaa gaagcgggac 
51901 ggaactgcaa ccggtgaatc tcctgcgtct ttgtcggaaa cctgacagac tccaagtcca 
51961 aaccgaaagg cggtttccag cttttcggtg gaacaatctt gcagatcccg taagcctctg 
52021 cctcgggctt gatcttgtgt atgtaatcta gagggctttt gaattcgtct tcggtcgggt 
52081 aataaaccgg tccagacggg acattgaccg atcctctcag cttctgatct aaaacccttt 
52141 tctctactgc cttagctcta ccttttccca ttaccaccaa acacaatcca ctatcataac 
52201 ctatggaaca attcaatttt gatcacaaac cctagagaga ttaatccaag aagagtgagc 
52261 gtgaatccaa ttctagggtt ttagcgattt accttgtgag agagagggat agagagagac 
52321 aagtagtgca aggaatcttt ggtttgtgaa gaaagatttt gagaattttt aataaaatat 
52381 atatcgtttc tttataaagg aagattcact taattcaatt tccgaaattg ggttttctta 
52441 gtaacggata ttacggaacg ggggaggaga agggaagcca aacgcttctc agttcgtgtt 
52501 tatggaaaga aaaactttgt gccggttttg ttatcaagta ctgcatgccg tttatattga 
52561 aacaagctgt acaccgtgta gtttcgtttt actcgctata gtgggttttc tttcttggcc 
52621 ctgaatgttg aaggcggctt tgttaggccc agaagatgat taagctaatt agctaaacta 
52681 gttttttttc gacaataagc ttttttattt atcgtgacaa aaaaaaagct tttttattta 
52741 tttattatta ttatcatcat cttttccgac aataagctaa agggtacaaa ttcagagttc 
52801 ttcacatacg aaatcttgtt gaactatata agtgatctta aactacaaaa cttttttttg 
52861 aaaagtgact ttctaaaact atctgaattt cttaaatgaa cctttacatt ttatttacat 
52921 gtaagatttg atagaaaaca aaagtttcaa gaaaataagg aaaagttgta tgttttcaag 
52981 gtaagaagaa acaaataaaa cagtaaaaat attattgata ttgctcgcag cctcgcaggt 
53041 aaatgatgtg atggtcggtt aaaaggcgtt catactgttc aaaaaattat tggagcgtta 
53101 tgtcggtggc aattagaaac cttaaattta ccatactttt tgtatcaata aatacctttt 
53161 taattttatt aattaacacg ataattaata tcggttacga atttggttcg aacaaaatgg 
53221 tctaattaaa aaaaactaac aaaattcgct catccaccat actaccatat tccgctaaag 
53281 aaaatgatag actatcgatc ttcgttttct actattgtag attaccacga gaagatgaag 
53341 tttacctcgt ttcttcggtt tttggaacat catgtggcat catgtgtcat caccggagta 
53401 tctggtttat gatgatctcc caagcttatc gtctctactg attatgaagt caaatgtgtg 
53461 ctggtggttc taattaacac agtcaagtgt gtgttgtggt tctcattaac acagtggcaa 
53521 gtcttgagac cttgtatgga ggtgggatga tacgggtttg tgtaggtttc atgagtagat 
53581 ttgacagccg gtcgtcaatg gtcttttagt ggcttctcga tgtgattcct agcttcgtat 
53641 ctcctctatg ctgcttcctt ccccattaca tccgatagaa cgaaggtgac ggctttcttg 
53701 atcagagtgt cacttctagt ttttcatcgt tgatatgtct ctatcgacta cggctcaaat 
53761 tattgttctt cgattctcga aggcttcaag tgttgtagcg attcttgggt gcagccgtag 
53821 atcgtcatgg tgtcgctctt ggtcfcatgct cacatcccat cgcctcacga gttccctatt 
53881 atgtggtggt gtagcttaac ttttatatgc aggtttagag gcttaatatt aggaagtggc 
53941 gatgtttcaa tatgtttctt gcattgttca atgtggctat ggtgagtgta tagtttggac 
54001 cgatgcttct cttataataa ttgatctttg aatttcttct ttattgtcgt gtgcacttaa 
54061 aaagtggttt agattacaag attggataag atctttggga tccatctatc ccaaaaccat 
54121 catgagaagt ctggacatct gaagtaacaa tgtgaatttc atatttttgt cgttgaaggt 
54181 tgagaacagt gacggattct cgaagattat cagtccaatt taataaaaca cgtcgaataa 
54241 taagtcgaac aaagctaaag ctttcagggg atgagagtcc tttcctcgaa ttcagaagtg 
54301 aagggcttcc ccgcggttat gagaatttct ggaccgcttt cactttctgt tcagcattag 
54361 taaaagttta atttgagaat ttttgttcgg ttcaacgctt gtaatcaaaa actttatttc 
54421 tttttgagtc ttatttataa aaatctacta tattaattta gagtgctaaa atttatctac 
54481 catttagaag tagtcatttt aatttggaca ttctctagaa tagtaggtgt ttggctactt 
54541 aaatctgttt ggttccttga atatatcaac atgtctaaca acttaattta aaccaacata 
54601 aatacacatt ataaattgtt taactaatgg gctttggaaa attattaaaa ttgtgttgct 
54661 ttacaaatta aagaccatac gattatgtaa accaatttac cttaggcctt ataaatatta 
54721 ggttttaacc aactaacatt ttatttgaaa cttaaaaata aaagactata taatatatcc 
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54781 acaaacaata ttataataaa aacacattat 
54841 tcgttttctc agtaacaatt aaaagcattg 
54901 tcacacatat taacgtttat atacagtata 
54961 tatacactaa ttttttttaa taaatattat 
55021 ttcaattatt atatcgataa attaataacc 
55081 tgtagtccca aacattatta atttatagag 
55141 tgtgtatatt ttataaccaa aaacaatatt 
55201 ttttttataa tttgtatatt ctaatctcgg 
55261 tccgcccttg aaaagggcgg gtcagtatct 
55321 gattaatggt tggtagtagt gtgtgatagc 
55381 cccataagtg agttgacttg gtgaggagag 
55441 ctctcttaca tatacactaa cccccaaaac 
55501 actttctggc aaagttgtct gcttttgtgt 
55561 cttgagcttt ctctctcgag ctaaacaccc 
55621 ttttttcaca tttattctct cttctcttct 
55681 gatccgtttt ctcgattctg accccctttt 
55741 tacaagaact gtgttttgtc tgatctgtta 
55801 tcaacgcggt tttgatctgg tcaaggtctc 
55861 gaggggttta tggtttgact gaaacacttg 
55921 tgtacattct gttgcctttc agagcctcaa 
55981 aataaaaaaa cttatagctc tgatcattta 
56041 gatttgtttt ttttctttct ctctcaggtg 
56101 atgtgcctca gtttcgtgaa ttctctatag 
56161 ctacagagaa tatagtatca gagcatggtg 
56221 agttggataa tcagagacgt atcgctgtca 
56281 ctcgtcagtt cctggtaact aactatcttt 
56341 tatgcaatct tggttcctga atctctttgg 
56401 tgttggtcag ttaaggaact ataggatggc 
56461 agagagactt cttgttgctg agtttatgcc 
56521 ctgtgagaat catctctctc ttctctcttt 
56581 aagacagata cttttgatat gtttcagggg 
56641 taagagtagc tttacatatt gctcaagctt 
56701 tctaccatga ccttaatgct tatagagttc 
56761 tttgtttgtt tttgcttctt tccttgttga 
56821 tctctttgtt taggactcga atccaaggct 
56881 ggatggtaag agttatagta ccaacctggc 
56941 tatatatact taaccacttg catatcagtt 
57001 tctgacataa tagtttttgt gtgtgttgtg 
57061 atgtacagtt atggaactct gttgcttgat 
57121 catgtaagcc ttatggttta agatactaaa 
57181 ttctgagaag agtggttctt atctaatttg 
57241 ctggacctca taagggacag gaacattcaa 
57301 ttttcaagtg atgacgggac tgaactggta 
57361 cctcgagaac ggcctaaccc aaaatctcta 
57421 cttgaggtat tttcccacaa ccacttttat 
57481 gctgatgtgt tttgtgcatt ttaaaaaatc 
57541 gcataccaag cagtgcctca acaacgcctc 
57601 ctgacctaac tgccatacat gagattgttg 
57661 cagccacaga ggttcacttc cttaacacca 
57721 tgtctcattg caagtccttg gcaagactgt 
57781 ttattttaat ctgatcacca tttttggctt 
57841 cgttccagat gtggaccaac cagatgcagg 
57901 ttgctttcag gcataaagac tttgcaaatg 
57961 ttacttcttg attgtttgag gttttcaggc 
58021 tgttgttatt gacagtttat agagggtggg 
58081 agaagtctgt gtcacctgat gaatgagatg 
58141 gcccaagtga tatctcccgc ttggcatatc 
58201 gctctaggac aagagaacga agcacacgct 
58261 aaaagaaacg ctctatgatt atgataacaa 
58321 gctcgtaggc tttgcagaat atgatcaaaa 
58381 gagccactaa acggctctca tattgtgtgc 



ttttatggga ttaaagttta tttgattttt 
tctaaatatt ttacctataa attttgaatc 
aactctttga attaatactc tataaattaa 
aactttatag tctcaacatt gaatttttgg 
tctataaatt aatttttttt atagttttgg 
gtttcactgt acaagtttta atttaaaaaa 
attcaaaaac tatatatata aaaactattg 
taacaattag aaaagtttgt ttaaataatt 
agttttaatt taaaagaaaa gaaacagaaa 
aaaaatgagt tgttcttagt tttcttagga 
gagaggcctg ctgcctctgc tttgctgttc 
acacgaaaat ataaagtttt ttaattttta 
tcccacaaaa aggatagaaa gttttaggtt 
acacgagcgc ctcgattcct catttccaac 
ttcctctctg ctttactttt aatttccacc 
ttcgcatcgt tttgttaagt tgaagaacat 
atttcttttt cctaagctct gaaaagaaat 
gattgagata aaaatcagca gttttttagg 
attaaaatgg gttgtgaagt ttccaagcta 
gaatccaatc gtggactcac cagctctggt 
gtttttaccc tttttgttgt tttaatcgtg 
gtgatgatag gataggtggt gaagggaacg 
agacgctaag gaacgctacg tcagggtttt 
agaaagctcc caatgttgtc tacaaaggga 
agaggtttaa caggaaagct tggcctgatt 
gtttgcttaa taataaagct cttaccttta 
tttcttttga caataggagg aagctaaagc 
taatctgctt ggatgttgtt atgaaggtga 
taatgaaact ttggctaagc atcttttcca 
acatgaagct tgtcttgtgt ctcagtatgt 
agtcacaacc gatgaagtgg gcaatgagac 
tggagtactg tacaggcaaa gggcgtgcac 
tctttgatga tgtaagttaa aggttatatg 
tgtgtgcatt ataaaattca taatatggtt 
ttcttgcttt ggtctgatga aaaatagtag 
tttcactcct cctgagtatc tcagaacagg 
tgctttattt gtactgtggc taagtttttt 
tctgaaggtc gcgtgacacc agaaagtgtg 
cttctcagtg gaaaacacat tcctccaagc 
tcatccactc tgtaccaagt accatatcat 
aactttactt gtactgtgta ttcttaggcg 
atgttgatag actcatgttt ggagggtcaa 
cggttagctt ctagatgctt acagtatgag 
gtcactgcaa tgatccctct tcagaaggat 
tttatctttc acaagttctt atatctattg 
atgtaaagac tccttcacat caactaatgg 
tttcaccact cggagaagca tgcctaagaa 
aaaaacttgg ttataaagat gatgagggag 
accttttttt ttcttaatca ttctcgtaat 
aatccacagt gcattgagct acctagattc 
tgtttgttat ctgcaaaaat gttcagcttt 
actcgctgaa cttcaagaaa aagggtgatg 
ctgctgaatg ttattctcag gttagaacct 
aaaccatata cagtttctga tccaaaaaag 
acaatggttt caccaactgt ttatgcaagg 
cctcaagagg cgttgaatga tgcaatgcaa 
gcatcttatc ttcaagctgt agctctcaca 
gctcttaaag acggatcaat gctcgaaagc 
aaaaaaaaga ggaaaacaag agaaaaaaca 
cccaccacca tcttcctcat aagtagatag 
atcctcatca ctggtcagtt tcgtagttcc 



105 



WO 03/006622 



PCT/US02/22217 



58441 ctcttttttt tttcaaatac tggatgatgc attcatttgc aaattggact ctcttttttt 
58501 ttttgtgtgt tttctgttac tggcatgaac atttccctat tatacagaag aaaaagttgc 
58561 ctactttgat tatggactgg gaatggtgtg attgtggatt ttctttggtg ggttccgtgc 
58621 tttgtaatca atgttttgta ataacaaata acatgtatgt aaactttgta ttttggcttt 
58681 ttgatgatct tttgaatttt gcatatcaga gatgttcttg tttatgtgct aaacttttct 
58741 gtcttgagat ctaaatatct catacataaa aaagggccca atagggttca aattcagtac 
58801 gaatgctcta ccactgagct atggacacgt ggatgcatag tttgcatttt ttttttgtta 
58861 atgataaata cattaatctt actcgagtag catgtagaat agctgtgtgc aggagtccat 
58921 gctcttgctt acaagaaagg atactaggga aagtaaaaaa ctaagtggtc ttctgtagct 
58981 taaatctata aaaacacatc acacatgttc ttttgttatt ttatctctaa ttgaaaatat 
59041 atttggaaga atgtacagaa aggttatttt tttatttaat ttactggagt ttcagaggct 
59101 cctctggtca ggatattgtt tctctccttt tgcttttagt tgggggtact gattgcatct 
59161 gctttttttg gtactactct tcaggatcac tactctgaaa ccatcaattg gtatttgatt 
59221 gtgtctttta gcacaccctt tctatgattg gcttagaatg caagtgaaac caaagctttg 
59281 tctttttaaa aatatttttt tgagacaact tttgtccgtt gctttcatta ccttcacaac 
59341 tacttgattc ctatgtaact tggaccttta gatctgcaag gagcgtagct gaatcttgac 
59401 ttctctatca agatttgcga ttttaacgtc ttctctctgt aaaagtttaa tgaacagtga 
59461 tagctcttcc atcatttttg aaggagtgtc tggtaagatc aactcacttt tctttttatc 
59521 tgtttgccac attctcttct cttctcttat gtacaccatt atgatcaaag gctcctagtt 
59581 ttagtcatgg aactggcatt acaacaatac taatgccttt agaatcaaat tctctgcaga 
59641 agttttggcc ttgtctgatg gttatataga tttgttgatt cctacaaaag cagcaactat 
59701 ggcatcgcag tgttctgaaa acgttattcc tactctacgt gatttcttca actcccaaaa 
59761 gccaggggaa gaggatgagt ttatggttca tcaggtcctc aactcctctt ccaagaacat 
59821 tccaccgaga cctacaaaac taggaaacaa ggcaaaagag agacacggtc tcttcaatca 
59881 gggaagaata agaagcagaa gaaacatatc agatgctgaa aagttctctg tggagcaata 
59941 ctcttctagt ggtttctttg gggtacggtt caacacaaac ggaagacaac aacagcaaca 
60001 acgatcagct aagcctttag ggtcagatag aaacatggaa ccgagattgc agaagtcatt 
60061 ctccgctaga atgcaactcc cctttatgct atcttcaaag ccaagcaacc agtctacgaa 
60121 taattcaagc tggtttagcc gtatcaagaa aatgtctaat ccattttcaa atcgaaactc 
60181 tctgatacca aagtcaggag aaatcaaggt cagtggagta ggagaaacac tctcaagaaa 
60241 caagtcttct tcacctgttc atctacatgc ccatctcagt atccaacatg aacttgggat 
60301 gcctgttttc accttctctc tagaccaccc ggatgatgtg tatacggcca ggacgtggat 
60361 ggatgttaac gactctcggt tcgtctattc gtttcgttac attggtggta gaagcaacaa 
60421 gaacctcgga gaacagaggt cgaacgtttc aggtatagac tcttcactta taggacagat 
60481 gcaagtttca actcaagtct ccttagaggt agaagaagaa ccatacgaag atcctgttga 
60541 atcagccgtg tcagagttcg ttctttttga catcgcacga gcacggagaa gtggactcaa 
60601 gactgaacaa ctgtcaagac agaacagtgt atctgatggc ttaaagcatc tgcaaaggca 
60661 gaacagtttc agcagagggt tgactcgtag tttctcaaaa cattcagaga acagtgcatc 
60721 atcatcatct gatccttggc cagccacgga tttacatcca ggtctagaga ttgcagcagt 
60781 cgttattaaa gactcttatt cttctagcaa taatgagagt tttgagtaca tgaagaacag 
60841 taaactctct agacgagaga tgaaggttat agttccatca ggaaaccacg gtttgcctga 
60901 tgctgaaaac tcatgtccta caccgatact gcagagatgg agatcaggtg gtggatgtga 
60961 ttgtagcgga tgggacatgg gttgtcacct ttttgtctta gaagaacaac aactcatcaa 
61021 caatcaccac tgtctagaac ttttcattga ggtatgattt attttcttgt agtagccaag 
61081 ctaaatgaat atctctgtgg acagttccag gattagtcgg ctgggtctcg gcccagcgga 
61141 caccatgggt tatccaaaaa aatattttat tatggattat ccaaaaaaaa aaaaaaaaaa 
61201 tcttgtggtg gtgattcaac ttgtctttct tgatcttgag atatacaaaa aatgtggttc 
61261 agggagagaa ggagaccaga ccagcaatgg caatggcatt cataagagaa ggtcattacg 
61321 aggtaaactt ccatgcaagg ctctcagctt tacaagcatt ctcagtatgt gtggctgagc 
61381 tgcacagaac agaagtatca agaggagaaa gaagcgactc tttgtctagg tgcagttcgc 
61441 ttagagagct tattgacatg gcaactccgg taaacagaga tatcaacgaa gaagtcattt 
61501 cgtctttcat gcctaatgtc actttctctc caatttcaag ggtctaaaaa aaaacatatc 
61561 agaacgtaat atctccgtaa tatctccgta gtccttcctt tatgtgcata tatagctaca 
61621 ctcgacatat aataaagtta taaagccaga gtctgtgata caagtgcaga tgaaattagt 
61681 aaaatttaga actttagcag cacaaatgtt tgtacttact ttgtcctcta tgactcgaag 
61741 aagtcggttt gtttcaaatt ttatactctc acttgccacc cgtttcataa tatgtatcgt 
61801 tttcgaatct gttgtttttc tatcctctat atcataaaaa aaaagtagtc gtttacctaa 
61861 atttaaaagg aaattagtcc atgtcagttc agcttataga gtaattgttg tacatacttg 
61921 ctttggaaac aaatctgtaa taaaaattta gtact'tcttt atatgtgtga aaactcttct 
61981 aaaatgatca attctaattt gaaatagagg aattgtgaat ctcactttac tccagtgtct 
62041 ttatcttatg ctgctgagtt ttaattgact cggtctagtt ttctttttat ttttcgtttt 
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62101 aacatagttt tattaaattt taatattata 
62161 aatttactta ttatgtagca attatgtgac 
62221 gaccatggtg tgaaaacatt gtcggcaaac 
62281 taaaattaca atgcaaaagg taatcagatg 
62341 tttgattatg tgtggaggaa gtttctagca 
62401 ataattgcac acttgtgcat gggggtggga 
62461 ttgcacttgt caatacatag gaatctatct 
62521 acatttattc tgtaagtttt taaattaaat 
62581 ttaaatcatt aatgtttctt tctttatact 
62641 ttctttatat tactatcaat gtttccaaac 
62701 caaacaatac aataattaat cttagttatg 
62761 gtagaaacgt cataatttca taaattgtaa 
62821 attacaaatt atgaaattat tacaatttaa 
62881 caattcaatc ggttaatctc gggttttagt 
62941 taaattgaat tgtcagatct taaccggtat 
63001 gatttaaatg taaaaatatt ttaaatacac 
63061 gttattagtg aaatttttca tcgtaaaata 
63121 tagtttaaat attattagta atgagagttc 
63181 accaaaaaga gaataaatca aaagagaggc 
63241 gaaatagaga ttcacgccac gtcagatgtg 
63301 aggcccggca atatgttcat gcatctgtaa 
63361 ataaattaaa aaaatatatt aaatccctcc 
63421 cgacatcttg ttgaaacaaa tgatctcaat 
63481 gttttaaacc ttaggcagcc aattccttag 
63541 gactagtata ttttggtgct tacctttaaa 
63601 atataaattt tcatttatgt gcaggtccat 
63661 tttgtttaca aattcaccat gaaatttaca 
63721 ggcaactggt ctattgggtg ctgttattag 
63781 atcatctaaa atcttaactt ctcttaactc 
63841 gagtctatca acaagccggc cttccgtatt 
63901 caatcatatg tggctatagg ggctttctta 
63961 tttttttttt ttgatgtcct ccttttctgt 
64021 ttttgtttat gattgtgttt gatccaattt 
64081 tttttatggt ttgaagtaat gaaaggtttt 
64141 aactgtttgc aacagtataa acaaacaaat 
64201 ttcaaaacga tggtaaagaa acagcatata 
64261 agaaaattca aaagaaaaag atatatcaaa 
64321 gaactatgtt gtaggttaac ttaaacgatg 
64381 ctagataaga gaagaagaaa gcatgtcact 
64441 cagaataatt tgttgcaaac actattttct 
64501 tatttacttg ataataataa agcccacaat 
64561 aaaacactgg cttaaaatgc aaggaaggaa 
64621 tgacttcagt gacttgagta actgagatcc 
64681 ttactgatca agcatacata acttgagccc 
64741 cacttgtctc tgtatctggg atcttagtgg 
64801 tcacaaattc aatctttgag agatgagtag 
64861 gagaatgcct gataaagagt agaagaaaca 
64921 taaaatcaca atatattctt cttgatctaa 
64981 tctccaatac atatgacgct ttcgttgtga 
65041 atctgtgtcc acacaaaacc agctgtccga 
65101 tttgcttttc agatcataac agaaaaaatt 
65161 tgtaatacca ttgatcaatt gtagctgaga 
65221 tccttgtttc caaaacccat aaatccacca 
65281 aagagttact tctcctcaat attctataag 
65341 ctctccatcc ttgtggtcca tttttattat 
65401 gatagctaat ttaaagcaca ttttgaaaca 
65461 tcaatttaaa attctacaca atataatttc 
65521 ctcaattgat taaaaactta gatgggttca 
65581 agcagattag gctagtctat ctgcaggccc 
65641 tacgaaaatt cgaatactgg ttattcattc 
65701 gccatagttt cagttgaatg aaattaattg 



aagagtttac aagacacata accaattttt 
ttgtgaatat aattggaaaa aacagtggtt 
aaaaagacag attttaatta tttttgtgta 
caaacgaatt taaaaaccaa cgtttaaaga 
acaattatga tttccgggtg cataattatg 
ccttggcaaa acttagcaca attatgataa 
tattaaaaca gaaacattct attggaccta 
acacttttat actttatagt taaacttaca 
gctatctatg tttccaaaca atatatttat 
aatatatttt tatactacta tcaatgtttc 
ttatatctat cattttcttt ttaaaattgt 
aataatgaac tttaaaattt ggagtataag 
atcaaattaa attacatatc ggtcatccaa 
aattttttta atatgaatat tttaaaaacc 
aatcacaatc gggttgaatt taaaaacagt 
acttttaaaa attaccaaaa tatttgttaa 
ttccgcgctt ctaaagcgcg ggtcaagatc 
tattaagaaa aaaaaagtaa tcaaacctat 
tgtccacgtt gacaaaaaaa tcggtcaata 
tcacttaagt cgttgggctt cgaattccac 
agtattggtg ctattttgtt ctcagctatt 
agtagatata gactcagagt agtaagccat 
ttggttacca aagttcaaac tctgccatag 
ttatttactg ggtttcattt gggtcgagct 
cttttagtac gctatacata agtttcatat 
ttattaggtt gaatagttct tttgtaaggt 
agtgttcaaa cacatatgga atcttgagga 
actgccagca tggccatcct ttaaatggtt 
cttatcacat tggaaacttg aaaagcaaga 
gataggagtg tgatctcctt ggatctttct 
attggatgtc ctcctttttt tttttttttt 
ttaagaaagt cggtttgagt gtctttcttt 
ggtggttccc tatccttctt gttgtattgg 
tccagtgtct aagacaagtc ctgatgaaaa 
atctaaagta caagataaat gtatcacaag 
tatacatcga tactcaaaca aaaaaaaaga 
tagtagtcta catgaatgtt caccacacga 
aagaaatata tttatctttt tgaaaatttt 
ctaccggtac aaatacaatt ttcatcggtt 
agttaattgg ctgggtttct tatttgggct 
ataattgatc aggcatgttc cgcgagttta 
taaaggatca tatataagat ttaggttcat 
ttttttaccc cagttttggt gcgaagaaac 
ttgcacattt tagtactagc tagttcatat 
gcatgtgcta agcccatctc caacacaacc 
aggtatgaat agaagtggct gatatcgaga 
aggttaatta actcctttaa aacaaaatta 
aacacgcaaa agcatgcatt gtatcagcat 
tccatagaca tcaaatcgaa acgtgtgttt 
tttacctttg atttcaatct tgctcgtatc 
gtcattttga gttgcactta taacctttaa 
aggctacaat aacagagtct tcctgactat 
aactcttgct ctaagattgg tttggtcaag 
agctcccatt cgtgacattt gtaatcagct 
atggctttat atcccctttt ttcttaatat 
atgcattttg tgtgagtcta taaaagagtt 
atgatttcct ataacttttt ttttgtgcaa 
tacaattggc ccgataaacc cattggagaa 
atgattttct ataactattt taacataaga 
ataacattaa ttgtaaatcc agccaagcat 
taaggttgct agctttaatt atattagttc 
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65761 ggaagatata aactatacta taagtgtagt ctattgtttt gaatagagaa aatataaatt 
65821 tgttttaaca gagtccttct tcccccaaat aaatcattaa ataccttcac atgggaaatt 
65881 aactcacact gcaaatatta atagcttgct tttggtgtga agctacgtga tattgtagtc 
65941 ataacattta cttcgacatg gccatctttt tactagtatt ttataagttt cttcacaaga 
66001 ttttgtaatc ctgtgattaa cattaataca cattcaattt tcttcctgct tcgattaaga 
66061 ccctttactt gtacgttggt ggtctacata aagacgtata catatctgtt ttcattaaat 
66121 acaatcaaag actattcgtt ttcttctgtc aacaaatatt attcgtaatt cttccttctt 
66181 gataaatact tgtaccaaat gaaatttcgg agaaaattaa agaacactga caaagctagt 
66241 tacttaattt aggactcagt tttgttttat agcaaatgta ttacgtgttc gatttttgta 
66301 tatataatat taatattttt ttttggaaaa taaattttgc atactatgaa cacgtaacat 
66361 acatacaaat atatcactgc attttaacca tatttgataa acacatttaa aaacctacga 
66421 aaagttcaac aaaattggat agtagttaaa gcatttacaa aattaccaac tgtgaaccaa 
66481 gctgaggttt gtttagtgtt tgtgctatct gattttagca gtgtagattt aagaattcac 
66541 gttaataaat atagcacatt ataatcagta aaaatatttc aaaactgaaa acattaatta 
66601 aaaaaaagag actaaaaata atttcaaaat caaaaccacc gactaaagtt tgtagtatgt 
66661 tggtttgata ttagtattat catcatgcca ttatccaagt ataagaaaaa cataattata 
66721 aaatgtatgc tttaatatta aacgatttgt caacattgcc atttaaacag aaaaagtaga 
66781 attgtttttc ttcaaccggt agttaagcag aaatagttgt tggaagtaaa agactagtct 
66841 acgtagctgt catagttatt tcacctatcg taattaaact gagccttaat gaaaatcaat 
66901 ctctctctct ccctgaacag tggtctatgt gtatgtgtgt gtgtttgtgt ttataaaacg 
66961 gccttacgag aatcaacaag agtaaacttc tcaataaaac caaaatctct ttcttaaccc 
67021 ccaacgaaag aaccaatgag ccagaggaac aagaacggcc ctaagctcga gctgaagctt 
67081 aacctctcgc cacctccttc tcaagccaac ctgatgagtg tagttcgttc tccaagccga 
67141 tccaacacaa cttcgccaag ctcatgcgtt tcatctggaa cgtaccagga ggagatggag 
67201 acaacaatct caatggtact tgttggttgc cctcgttgcc ttatgtacgt tatgctctct 
67261 caagatgacc caaaatgtcc aaaatgcaaa agcaccgtcc tactcgattt cctcaaccaa 
67321 gacgcctccc ccgctacaac tgctcctgcc gccaccactg aacgcaataa gacatggtgg 
67381 aaattttttg tttaatccaa ccctttgtcc gaaatataaa cctataggct agttgatcca 
67441 accgtacttg tttctatata tttcttttgc ttcttctcgt ggttgtttgg tgttagagta 
67501 gtgctaatca cgctccacgt ttagaccaaa cattagcact aactttgctt ttaatgcgat 
67561 acgaaattat ttgagcaaat tatgttcagt atgatgaaac attcgcactg tgtgtctatg 
67621 tgcttctaaa cctttgcacc ttggagattt gaaagactaa tgtcctctgt ttcttaaaaa 
67681 aattacaaac ggtgtttctt gaattaataa actgtaaaaa gaaacattcg agattcggtt 
67741 ggtttgttga acataggaga tacagagaaa acttattaca acatttatcc cggttcgaat 
67801 ctcatttgaa actaaattct agcaatgtga aaagtttgtg tttgaattcg acccaaacat 
67861 tttataaagt agtctggtaa catgaaattg cgtgtgacat gacattaatc agaatgtatt 
67921 ttaaattcaa attaaaacaa tatgataaac agtccatttt ataatattaa atatattttt 
67981 ttatgtgtga tcggaccaac tgatagaata tattttttaa aaagatgctg aattttttag 
68041 gagaaatttc atatatacac tttttgagca aatgatgttc agtatgatga aacattcgcg 
68101 ctgtgtgtct atgtgcttct aaacctttgc accttggaga tttgaaagac taatgtcctc 
68161 tgtttcttaa aaaaattaca aacgttgttt cttgaattaa taaactgtaa aagaaacatt 
68221 cgagattcgg ttggtttgtt gaacatagaa gatacagaga aaatttatta caaacattta 
68281 tcccggttcg aatctcattt gaaactaaat tctagcaatg tgaaaagttt gtgtttgaat 
68341 tcgacccaga cattttataa agtagtccgg taacatgaaa ttatgtgtga aatgacatta 
68401 atcagaatgt atttcaaatt caaattaaaa caatatgata aacagtccat tttataatat 
68461 taaatatatt ttttacgtgt gatcggacca actgatagaa tgtatttttt aaaagttcat 
68521 taagatgctg aatttttagg agaatattaa attcatttag gtacctaaaa acatttttaa 
68581 aaatacctaa atcattaata ggcaaaaaat aaaaataacc tctcataatt tcaatctaat 
68641 tatatgtttc tttctattcc gataaaaaac tttgataaag acctagttga aatggttaga 
68701 gatgaaataa aactaccata atgtcaaaaa aatctataaa atcatttggg tttgtgcaca 
68761 tagttttact caccaaactc attacgtttt acctaatatt catattgaaa agaacgacca 
68821 aagaacaaca atcatttgca tatattcttc ttttttggac caaaccatca tatatctaaa 
68881 caacaaaaga aacgaaaatt gtatttagca accctaatat gcgcatattc ataggaaaaa 
68941 ctaataggct ttcagacttt agtttgtagc tactgataag tctaaatgct gaattattat 
69001 atattaaata caaaattaat gctaaggggt tgaaagttgg ttaagattaa agagaggtag 
69061 tagttatagc ttcaatagat taaaattaat taattcatta aaatcaagtt taaatgcaat 
69121 agtaactgag gcagttgaaa ataatgatcg ttacgtaagt atgataaata cgaatttaat 
69181 tcctccccta ataaaattag tataattttt attgtcaatt tttttttatg ttgtgtaatt 
69241 aactataggt caccaatttt cttctcttac cagacattta tcatcttaaa ttctcactaa 
69301 tcgcgtttat ttacttaatg caactgtcgt agcaattaaa cgaagtagat ctgtttgggc 
69361 cagaccaact tgcacagcct ctcctaatca gcccattaat tgttataaac gcttttaatg 
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69421 gtcaggaatg taaaatcaca gaccggttta 
69481 tttacattta ctggagatgc ttaaatcggt 
69541 cccacacggt gccagccacg gtgcagcaga 
69601 ataaggaagt accaggcgcc tgctttgaaa 
69661 ctagaaaaaa aaactgttga atacgagaat 
69721 ttcccatgaa aagcagagtt agaatcaatg 
69781 ttctgagaca agctaagagc agcatcaagc 
69841 ctgagaagtg actataagta gaaccggaag 
69901 agattcacga atagcctaac cgatcaacga 
69961 tttcgtcgac gagjbgttctg tcgtctaaat 
70021 ttttttcaat acctcgttaa atttaagaga 
70081 aaagcaggtt tcagacagta atcataaacc 
70141 gctgcgattg atgactgagg attagaacct 
70201 tgaagttttg ttcatcgacg caattgaaac 
70261 gaatagtgat caacaaaaaa gctcctcgcg 
70321 ccgtagcctt tccggtcaat ctcgcattaa 
70381 ccacgtcagc tcccttctct accaatcaga 
70441 atattcgcta cgatttagac ctgtttattt 
70501 ttttatattt taaaacagca aaatttatat 
70561 caagtattgt tgagctggac aaactgaaac 
70621 atttgataat tatgtaaaga atagtctgtc 
70681 gagcattaga aaaaaacgta cctaagaaaa 
70741 gtattttttt ttcactaact aagaaaattt 
70801 tttcatcact tccgttgtca atcggacatt 
70861 gtttgttttt ggtgtgtaca taattgttaa 
70921 atataatata acggcatcaa atgtaaatag 
70981 gacctattca agcaatatgc gggtttaatt 
71041 aagaagaatc taaataattt tctcacagtt 
71101 tcaaacttga atggattact ctctacacta 
71161 caagaaagag gagagagact gtcagacggt 
71221 tgatacttag attgttcaat ggtgacaaaa 
71281 tacttaaagt gtatctgata gttttttttg 
71341 tggatgtttt tttgttctgg aaatgtgtat 
71401 aaaactgtac agtctctatg tttcaagcta 
71461 tcaaactgat attgtttaat atcattgtat 
71521 tattcaaagt ttaaatttat tatcataaaa 
71581 aaaataataa taatttaaaa tagaacattt 
71641 ggacttagag agatattgat atatatcaca 
71701 agaaaattct atttttaaaa agaagatgaa 
71761 attaatagat ttttaaattc aaacaaaact 
71821 actttaaaac aatgtgttcc tattctgtga 
71881 aatcctatag aaataaatac ataatatatt 
" 71941 aaataagaac taaatccgag agattttttt 
72001 tattagattt gaatatcaga aaatgtcctt 
72061 tagagtttgt gaaaccctat atttgaaatt 
72121 caattttttt ttaaacttca aacttaactt 
72181 ttttatttgt cataattaat gtaggtccat 
72241 catattacaa caatattaat taataatatc 
72301 atacataatt aaatattaaa atacaagaaa 
72361 taatgctcta tcttcggtta ctcaaaattt 
72421 tttcagagca aattttccag acattagtgt 
72481 ttcttcatgt aacttttaaa aaaagttatc 
72541 aattttttta atttagtttt actataaaac 
72601 tatgaaatat gaaaaattgc agattaaaag 
72661 atgtatgtgt aattaattgt aaaaatcaaa 
72721 agtgaaactt caatctcaaa actctaaatt 
72781 aaaagctcta tattttaaag ttatagaatt 
72841 tcatctttcc ctcacagact aaacactctc 
72901 ataaaaaagc aacagcgtgt gtttagaatt 
72961 tatgtcaaaa aagaagaaga aagatatgta 
73021 tttgacaaaa aaatgtacat tacctcaaaa 



aagggtttaa ggaagaagag accgtgaaat 
tcttgatcag agtggctacg tgctgcgcaa 
atccatgagt ttgaccttcc ctctcgatag 
tctgcacagt ggagattcta agtgagacga 
gcgcttctgt cctttttgct ttgaacacag 
agttgccttg tctttgcaat ctgctgatgt 
tcaagaagcg gttggttgcc tccatttaaa 
aaacacaagc atgatcgatg accaatcaga 
gcttctctaa cagttcaatc agtctttgct 
aaagatacgt agcttattag attaattcat 
cgtaataaga tgcgccaata aacatgctct 
agaggaagtt tctaaagact agaactattg 
tctccacccg atctgagata ccaccgcctc 
aaccacagcc ttagtccatc cgaggcgatt 
ctgccatcat cggggccggg ggtgatgaaa 
gccctaattt cgactttgac cttagatccg 
ctttactttt tcgtatttaa tatttttttt 
tattatatat atatatatat atattttttt 
attaagattt ttgactccac gtgtacactc 
aaggcagatt tttcctcgta gtttcttggg 
aaaaaaaaat tgatagcttg cactatctat 
tttgtgtatt tttgaacaaa aaaaaatgtt 
gtttattctt taaaaacgta cataagaata 
gcatacaaag agaacttttg tctgtgtcga 
atcagattta gttttcttat tttcttaaca 
aataaattta gatggtttta tgtattttag 
agagttagtc taataatatt tcagtgaata 
tatacgaata ataaatccta aaataatttt 
aaacatgggt tccaacctga atggaacttc 
ggggaccggg tagcggacaa atagagtatc 
tgaagaatta gagtatctga tacctctttc 
tttttgactt aaaagtgtat ctgatagttt 
tttaaatttt gaaattttca tctgccacaa 
gtatattttc aacttataaa tatattattt 
taataataat taaatatatt taattatttt 
taataaaata aaacacaaaa ttttaattta 
taaaaaattg cagagagtgt aatttttacg 
tgattaaaat tagattaaca ttagatacat 
gatcttatct ggaatttttg gtttaactac 
tggtactgtt cttttatgat tttaaaattt 
tttatatcca attcaaatca tatatttttc 
ttgtattgta tttagaagta atctttatta 
tcgatatgta aatattaatt aaaattcaca 
taagagcatc ttaaaattga ctctttattt 
tcaagatatt tttttccaaa agtaaaactt 
caaaattatt tgtaatttgc tttatagtct 
aaaactttat aaataactaa catataaaaa 
taacacaaca atatcaaatt agtaatataa 
atacgacatt attacctaaa actatttatg 
gtttggataa taatttagag atttgagagg 
tgtaatattg aaatatgggt aataattttt 
aagtttcttt tgtaataata tatatttttt 
ttgaatttat aaaaaaaaat ctgaaataat 
gataaacaaa aaaatactta aaaatcataa 
atgcaaataa aaattgaaac ttctttgatg 
tgaagtttta aattttcttt ttatagagca 
gtttttacag atgctctatc aatcccactt 
taacaacttt gccaactcca ctatactaaa 
ctccgtgcgt gtaatgattt cctttttatg 
cttttttttt gctaaaaaga tatgtacttg 
acaaatgcat taaaagattt atataaaatt 
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73081 gaaacattac gcatgtattt tagattttaa 
73141 actgaaatat tagaaacgca ttaaagaaat 
73201 gaaaaatata aaactgaacg agaacctaat 
73261 cattaatggt taaatttgtt tatcagcgaa 
73321 atcaatcata tttgtttact cactagagag 
73381 tatctaataa tttttttttt aaatacaaaa 
73441 actgttccaa tttttaatac tcaaaatcat 
73501 ttaaaataaa tcgaaattaa taacaacaat 
73561 taaaaggatt tctcatcaga aaaggtttct 
73621 ttcagaatta aaaacctcta ttataatcta 
73681 agaaattacc ggactgtttt gaactcaatg 
73741 tataaaacaa aagaaaacaa aaagagttag 
73801 aggtataaga acacatggta cataaatcta 
73861 attaaaatca gaaccacgca ataagagcgc 
73921 agttttaata tgatttaatt attgattaaa 
73981 catataaaga caaacacaaa gacaattcca 
74041 aatctctttt ttttttcttt ttgtccactt 
74101 ataaatagtg ctctgaaact cttctagatg 
74161 taaaattgta aataccagtt agaccctatt 
74221 caatcttgcg aaaaatattt gtatcacatc 
74281 tatttgatcc ctctttctat aattataaaa 
74341 cgatctgaaa taaattatac caaaatcgaa 
74401 aatttaattg tcaaataata gttcaaactc 
74461 aatacttgaa tcacaattat atatatattt 
74521 caaatacact aaaatatttg gaattatcca 
74581 aaaggtaaaa ataaatatct taaaacaact 
74641 atatataccg tttctccacc caaatatcca 
74701 tatttggtta tacaatattc atatttatat 
74761 attttaaaat agatttggat tattgttgtt 
74821 aaaccaaatc tctaaataat ttaagttatt 
74881 tgtaattact taacatttag atttataact 
74941 caaaattagc aagagtgtgt gatctagttt 
75001 cgtacatagc aaaacaaaat ttaaataaaa 
75061 tggcaagcta agctctgttc tttctcttcc 
75121 gaagctgctc tctctctctc tctctctctc 
75181 tcttctctct ttcgctttgg atattttgat 
75241 gaacaggttt ccctcccttc tctctttcac 
75301 ctctctctct ctgtgtgatg gagatcgagg 
75361 ctccggaaat gaagcagagt tgggttctca 
75421 tttctgttgc ttgcatctca agcaacaaaa 
75481 ctgtgtcgtc tatcgtcgtg ataacactgt 
75541 caccaccgca agacaatagc accatcgcac 
75601 aaatctgtaa gtagagatgt tttgaatcaa 
75661 tcttttgatg ttctagccgg aaaactgcca 
75721 tcttgcttga acgatgggaa ttttcccggg 
75781 tacgatgctg gaggttcgat caagacaagc 
75841 agctggagtg tcattgaata tgggtcgaaa 
75901 aaagggctca ttaaatgggg aaccgactac 
75961 acgatctatg aaatggtgta tcaggtataa 
76021 caattcggtt caggtcctaa acttttgttt 
76081 agccaagtga ctagtgacct atactgctgg 
76141 tctgtcaatc tatgttacac agactgcccg 
76201 gcctcagcct cgatcgtatt ccgtgaccaa 
76261 gccaaagccg tttaccttta cgccgaggct 
76321 tgggatgacc tcatatgggg gggagcgtgg 
76381 cttgctaagg taaccagtca cgatctagcc 
76441 cgttatggtg tctttggttg ggacaacaag 
76501 aggcttcctt agttctgttt ttcttccaac 
76561 ttttttttct gtgtgccttt tttagttgct 
76621 tcccttcccg tatgaagaga tgctcagggt 
76681 ctcctacttg ccgtattaca ctaagtttaa 



tgcatttggt tttgagataa tgtacatttc 
gtataattta tataaaacaa atgcattaaa 
gcattacaga aatgtacatt accacaaaac 
gttttttaag atcatgtcac ttgaagacta 
accaatgttc cattagtgag gttaagagag 
ttaagagatt gaagacggag atattttaca 
atttactttt catctattgg tccatctatt 
aaaatatcat tatttttata ttttaggaac 
catctaaata atttttcata atacacacgt 
gagaatttga atggtttgaa gaaagaaaag 
attgacccga cgctgaggaa ataaagaaca 
tgaagtaaaa tataaatgaa agattattgg 
taaaaaaatc ttctttgtga atgatatttc 
ttccatcgat ttaaccgatt taatattgat 
ttttaaatca aaaacaaaaa tcaaaccaat 
actttttttg tttgttggag tttaaagcac 
ttctatttct tgttactttt taattaatat 
tccttaacaa taacgaaccg ttcaaaattt 
tatttatttt ggttcaactt aaaccattat 
aacttgtttt agttagtgtt attaagtcct 
aaattattct cctctaagcc taataaattc 
ttttttaata aaatataaga atggatcaaa 
aatctaaacc aaaatatttt ggatatccaa 
gaatatacta atatttgtag atttaatatc 
aaacacctga aaaataaaaa atttatccaa 
aaaacactca aaacattaaa aatataagta 
agccgaacaa attttcatgt caattttaga 
attatatatg gcattatttt tggttttgag 
tttacataat ttaaatatgt atctgaaccc 
tacgtatttt agactcttga aaagaaaact 
tcaattaatt atctacagaa aatttgcatt 
gcattataat taaaatagta tttttctacc 
catttcaaac taattcaatc tagctacttt 
tgttctctgc aaacaaatat cacgagagtt 
ccctctctct ttccggattt cttcccctct 
ctccaatggc tgccaaagcg tttgcttaac 
gtgtcttgtt ctggtgttct ctctctctct 
agctcgagct tacacctctg agacagcctt 
aaccaacact ctccaagaaa aagaagaaac 
ggttacttgt atggcttgga ggagtcttgg 
tcaagacact cccccaccac cagagcattc 
ttcctatggc attgaaattt ttcaatgccc 
agtatatgac caaatgaatg taaacgttac 
gaggggaata acgtgtcttg gaggggaaat 
agtctttacc cacatctggc gggagggtac 
ttcaccatgt ctttctcaat gacaatgttg 
tatgaggctt gtggagaagt ggaccacgtc 
ttcctccgca cttttagcag cagttctgat 
aagttttgtt tcccttcctt cttgcatact 
ttcttttcac agatagggat gaatcaagga 
atgcgaccag aagacattga ttaccaaagg 
catctcgctg cagagatggc agcttccctg 
gtcgactact ctgcaacact tgttaacagt 
atgagcgcca agcggaaaag cgcagatcac 
ctctactatg ccacgggtga taactcgtat 
aaccgtgccg gtgccttttc ccatggccct 
cttgctggga cacaggtttg ttagttcacg 
tgattgcttg cttgcttgct tactctctgt 
tttcactcgg ttgaggctgt tcttgagccc 
ctttcatgag caaaccagca tagtaatgtg 
cagaacaaaa ggtatcagat ttttgccttt 
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76741 gttattcttt attactaagt tttgattgtg 
76801 agtgagtttt tggtcttatg tgaagggtta 
76861 cgacttgatt tagtcaggat gttttgatct 
76921 tttcatatct gcaggtggtt tgatcctgct 
76981 gaatgcagct ttcctggcca ctctgtacag 
77041 atggtactgt gggccaactt tcttcaaaac 
77101 agtattgaaa ccattaccat tatatgatca 
77161 cgtttgtgtg tttcttcttc acgcaggttg 
77221 tgagttatgt ggtggggttt gggcagaaat 
77281 cgatcccgaa gaacaagaaa gtgacatgcg 
77341 gcgagaatcc aaacacgatc gaaggggcaa 
77401 tccacgacgt acgtgtaaac tacaactaca 
77461 ttgtggcagc tcttgtagcc tcttcacgag 
77521 tctcagccat aaatcctctg tcgtttgcac 
77581 atttactctg tttttatgct tttattattg 
77641 agctcaggtt tgtatctctt ctcttcttta 
77701 gtgcaagact tgattttacc aaacaacttt 
77761 caaacaacac acactcttta tagtagactt 
77821 acacatacct ctagtcttta actattacaa 
77881 tgaacttagt tggcttacta accaactaac 
77941 gttacataac taacttcctt gttgaccttg 
78001 ccacagtatt tgcttattct tcagaacatc 
78061 tcttcttgct tttctatcat cttctcatta 
78121 ttactcaagg aatcacttct caacaatatt 
78181 ggagacaaat ttataatctc ttctcaatat 
78241 aagttgatct ccttgtaatc cctgacgtct 
78301 tattttttct tcaaccttgc ggagagtttc 
78361 ccaaagctat cttagtgatt tcatgactat 
78421 tctcccacaa tgttttcctg atgttttgat 
78481 ttatcactcg.caatgtagat gccgcttcaa 
78541 caacatggcc ttacctgcag cctctctcac 
78601 cattccacca aataacaact taccacaact 
78661 taccaatcaa atgacattat ataatacttc 
78721 agctgatgtc tctctcggtg accattaaga 
78781 atgtcacact taacaagaga gaaaatcaat 
78841 gccaaggcat caatcaagat caagcaagtt 
78901 tacaaacaac cgagatccat ggcggctttg 
78961 gaggagaatg aaagagttag ttattcttgg 
79021 cgttacggta tcaacggatc gtttttggaa 
79081 tgcacattag tattaacata attgttaata 
79141 attttgaata atcatatttt gtgattttaa 
79201 aaatccaata tttataacta attgaactta 
79261 gaatattttt tgtactaatg ataatatata 
79321 tttaatataa catttaaaaa acaattttat 
79381 tttagtcatc tattatatca tagtgtaaca 
79441 acttatatta tgaaagcgtt ataataataa 
79501 tatgttatat aaattgattt tgatgtgtgg 
79561 tattgaaatt aatatgttaa taaaaatcta 
79621 taagaaaatc tattatttaa attagggaag 
79681 aaggaaaaga caaaacctcc ttaaatataa 
79741 ataaataaaa taaaaaaatt caaggctaat 
79801 gatgtttgca tttttcaaag cctctaaacg 
79861 tatttttttt ttaacttttt tctcaacaag 
79921 caaaattaaa aataaatcca ataaaaatca 
79981 tttaaattaa atatttgttt ataagataat 
80041 atgttatttt aatacataaa taaatataat 
80101 ttttttaaaa gctttataat tacagtgtat 
80161 taaatgtaaa attttaaatc atgtaatgtt 
80221 tggtcccaag ttgaaccagt taaaaatatg 
80281 aaaatatata caaaaacggt ttgtaaatta 
80341 ttacaacact attaatttat agggttttta 



tgtatacttg tgaaagtgaa actattgggg 
tttagttagg atgttttgtg aaggattatg 
tttttagtat tgagagtgta tatcgtctgc 
gagcgaacca gagcctctcc agtatgctgc 
tgactaccaa ggcgcttctg acgctcctgg 
tgagatccta cgtgactttt cgacatctca 
tctccacagt tcgatcttat ctgatttaag 
attacatact agggaaaaac ccacacaaca 
atcctaaaca tgtgcaccat aggggagcct 
aaggaggttt gaagtggaag gagagcacga 
tggtcgctgg accagacaag aaggatggct 
cacaggcgac tctggtggga aatgcaggtc 
gaggtggagg atttgataga aacaccatct 
cgccgcctcc agtacctgag acctgagaag 
gacttgtttc aagatatttg ttgtgatttt 
ctcttgaact ctctttttct ctgttttgtt 
ttcattcatt cttgcataca ttcttacaaa 
cttgctgact tacaactaac acagcacatg 
caactctctt ctcaactctc taatcatatc 
tcttaacttg ctacttaatt ttaacagcta 
atctttgact tctcttcttc cttgcttcat 
tcaaagtcat cgccatcttc ttcttcctct 
tgatttttgt cttccttttc ttttgcaggt 
gtttccactc aaatctgttc cttcactgaa 
tgtaacacac aagtggtagg aaagacttga 
ctcagagcat catatctcac actcagctac 
aagttcagaa acctgactat gtgtcacatt 
aatgctgatc aggattctcc tcagagaaga 
tcgatattgc ttccttgagc tctatttagt 
aagcttccag agcagagtag ccttgcagag 
agccatttga gtctcatcaa gagtctggtt 
tgacaactga catagcaacc tgttaccgtt 
ctattcaagt atgaatcccc tttttcttag 
aaccagtttg gtatatgaaa atcaatgata 
gacaatatca cacagagact ttaaaatata 
cggatcttca ggcaaaaccc taaattcaac 
atctgtagct atgaaaaaat cgataggaga 
ctttgagttt tgggagggaa ataaagtagc 
ttttcaaggg get t ate tea tgtgtgcggg 
taacatttta aaacaataat tttatttatt 
tcatctatta tatcacagtg taacatataa 
tattacgaaa gcgttatact aatgatttat 
tatttgtaaa ttaatatgta ttacataatt 
ttattaattt gaataattat attttgtgat 
tataaaaatt caatatttat aactaattgg 
tttatgaata aatgttttat attttattta 
tttttatttt atttttttat tattaataaa 
ttaggaaatt tattttggtt aatattcgat 
acattaaata cttaaatcta ttatttaaat 
cttcatttaa tatttgagtt gcatgacatt 
tcaattttat acttttattt taatagtata 
agactttttg atggctttgg aatttttttt 
ttaaacactt aaaaataaca aaatatttaa 
aaataaatct agataatatg ctatactttt 
gtattaatta atttegtgat tagaaatata 
aattatataa aaataaaatt atatgetata 
agtatattaa attagaaatc ctctatgaaa 
tttatacaaa caaaattget gaaatatatt 
ccacaaatcg ataaagataa aatatatatt 
ataattctat aaattaataa aatatcatag 
ctatatttat aacagttaca atttctaaat 
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80401 cttttgatgc cagtcaagct ttaaattttt 
80461 aaaatcccta ctgctaaaat tctaaagtca 
80521 agcttcaata tttttttttt ctagccgtcg 
80581 tattctagtc tccttttcct cgcagtccta 
80641 gggagtactc cattcaaatt ctctaaatta 
80701 aattttgaat ctagtgtaaa ttatgagaaa 
80761 tattactaga ctaacctcat taaactcacg 
80821 aatcttctaa aattatttta tatacctttg 
80881 tttgacttat ttgtcatctt cttgatttta 
80941 tttatttgtc atattttcca aaattttagg 
81001 ttttagtttt aattatttat catattttcc 
81061 aataatttta tcggtaattt ttatatattt 
81121 ttgggtttat tttataatca attgttaatc 
81181 atttttatta ttttgagaat aattaaaaaa 
81241 tatatttaat atattaattt taaaatatta 
813 01 aaatcactaa aat tact tat taagatttat 
81361 tacggagaga atgaaattta atattactaa 
81421 ttatcattag attttttaaa atattctcga 
81481 aaattattta ttaagatcga taaataatta 
81541 agcatgcaat ttaataatca atcaaatatt 
81601 atatatttat tacgtaaaat aatcttgaaa 
81661 tgatttgtat ttattttaaa aatagtacgc 
81721 gaatcgtttt agataatatt gattataaaa 
81781 actatattat atgatatata ttaaaatata 
81841 tgaatagggc gaaccactcc gtggatattt 
81901 ctcggtcaac tggaatgtgt attatccata 
81961 gacctgagac gaagaaaaat gtctatctat 
82021 aaaatggata acgacaacaa tttcatacga 
82081 atattcttca ttaatggaaa ttcttttgat 
82141 tttaagaatt cttgtttaga cagttcgtac 
82201 ccaactcttc catgtttcca tcgtagtata 
82261 gtgtttctac aattctacca cccagtcaat 
82321 gttacattac atgaatcata aacaaaatat 
82381 aatataagat aaatttaact atgatattaa 
82441 gcaataaatt atcacctaaa tcagtaaaat 
82501 atatgactaa acctaaaatt atggagagta 
82561 tttagggtaa ttctctcaaa taaccatttt 
82621 agtaaaaaga ccaaaatagc ctcttcttat 
82681 tttgaaaccc tattccaaaa cccacctctt 
82741 tagctaaccc ttagggtata aatgtatatt 
82801 tttcttcctt gatagctatt tttatgaaaa 
82861 ctctaaaatt tatatacaaa atttttttat 
82921 tgaaaacagt acatattcat aacactatat 
82981 atgttgtaaa atatttttgg ataacataat 
83041 aaattaatga aattcgttta aaatttatgg 
83101 taaattagat tcagtttaat attaccaaaa 
83161 cacacacaca cacacacaca cacacacaca 
83221 cacacacaca cacacacaca cacacacaca 
83281 tgctcataaa ttttataatc atcattattt 
83341 agatattaca tcgtaatatt ttaaaaatta 
83401 atatgaaagt tttcaagttt attttccgta 
83461 attgtatata aaaataatat ttaaataatt 
83521 tattttaata gtcttctgaa tttataatat 
83581 agcccgtaag tgcgggcaaa acacctagtt 
83641 atataattga aaacaaggag aaaaaaatag 
83701 tattgtaata gagtttaaat ctaaccataa 
83761 tctattttaa atatagtgtt attcaatgaa 
83821 ttattcaatt tttaaggatt gtaatttttt 
83881 tctaatgaat aaatgagtga actcaaaatc 
83941 tgaactttaa attttaaaat atgatggatt 
84001 gtttaataat agaatgactc taaatttatt 



ttaaaagtaa cagctacaac ttttaaagtt 
tagtctaaag cgaaaattgt tagcaatcag 
gataattttt gaattcaaaa taatatcaga 
caatctttct tcgaacccat gtttcaatta 
ttttagtttt tttctatcag agttttctta 
attattcaga ctgatcttaa ttcactggaa 
tattgcttga ataggtcaaa catacataaa 
gcactatcta tattattatt tatgtagtga 
tattttttaa ttatttttca ttgattgttg 
taattttctg atttcttatg tttttaataa 
acatttaggt tatttatttg tcatgttctt 
attatattat tatataattt aggtggtaat 
ttaataagta atttagggat ttagctgata 
ctaatgatag tatcatattt aaatttatat 
tatatataag ataaaaaata at tat cage t 
tattgattat aaatttaaaa aaactaaaat 
agtaaatata agataaattt aactatgata 
gaaaacaata aattatcagc taaatcagta 
taaaatatga ctaaatctaa aattatgtag 
aaagtttaca tgcaactatt ttaataaaat 
acattcccat atatataaaa ctatatacat 
tgtaaaaata tttttggata acataatata 
ttaacgaaat tcgttttaaa tttatggata 
attagtttaa tattactaaa cgtgaatata 
ttatttcaaa acaatataat tagaattaga 
taggggattt ttcaattgag aagatctatc 
tttatttagt tattcagttg attegttatt 
catgegtatt tttgattttc caatggattt 
gtagtgagta atagttatgg ttgttcgctg 
catccataca taatgttttg atctaatatt 
ttgttccatg gaactaagtg gaagaaacat 
ttcggctaat taatgagaaa cctttctctt 
aatatacaca tacatatatg tactaaagta 
aattagtttc tttaaatttt tctcaagata 
tatttattaa gattaataat taatcataaa 
ttaagtttaa taattaatta aacattaaaa 
aagtttttgt cacgaaaata gcactcaaaa 
tttaaaaatt tttattttta ttttttaaaa 
aactctaaac tctaaactct aaatgtagat 
ttacccttta ataaaaattt atttggtcat 
taaactaaaa aggattatct aatagaattt 
caaaatttat tttatcatgt aaagtaaact 
acattgattt atatttagtt ttaaaatact 
atagaatctt tttagataat gatgattata 
at aat tat at at t tat at at atacacacat 
ccaaatataa tataaacaca cacacacaca 
cacacacaca cacacacaca cacacacaca 
cacacacaga tatatcacaa aat at at aat 
aaaaagattt taaaattatt ttatccaaaa 
aaataaatcc atgtatataa ttttatgtat 
ataaaagata ttttgttgaa attaaaagtt 
attaaatatt ttcaaaatat aaaaataatt 
atctatttac aaaatttggt aagattatca 
at at tat at a attgaaaaca aggatatatc 
ttatttatta gggggtttta ttcgttgatg 
tctactgtta tttgaatgat gatttaacat 
tgatttaaat tcagttttta aaatctagtg 
ttgtattttc aattgatttc aaatcatgag 
aaaggtacac taaagagatt ttagaattct 
gaaaatcact gatcgaataa cactacctaa 
ttaattacta gtttagattt catgcatgtt 
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84061 atacccttaa gttcattgtt ataaaaaata 
84121 ttattttatt catattttat gtagccacat 
84181 ttaccttaaa atcgatagac aattagattt 
84241 tattaaatat tcatagattt aattagattt 
84301 gatagatttt tttttgtttt gaccacctat 
84361 tcaactgatt tattatcatt tgaccataaa 
84421 tttgatagat ttatttattc tttgaccaca 
84481 tactttggtg acagatttta tagttattca 
84541 aactggttat tgaaaactgg atctaaccat 
84601 gaactaaacc aaaatatttc taaccaataa 
84661 aatttaaacc aaagaagaaa aaaaccctaa 
84721 agcgagagat tacgtctcct atgtctttca 
84781 aaaattgttt gtggtatctc tggaatgctt 
84841 acgggaggca aatccttatt tccctatatt 
84901 tttttttaat tttatatatc aaacaaaaaa 
84961 tttatttgat ttatagaatt tgataaaaat 
85021 aatttcgcga aaatcatcaa aatatgattt 
85081 tttgaaaggt atagaactta aatatttgcc 
85141 gggttttgtt ttgaagagag aagatgattt 
85201 aagaattatg aaaaaaagat ataggagatc 
85261 gaaatagggg aatgggaatg ggaaaagttg 
85321 ctattccata gacagcaaga tgaagaacac 
85381 cactccaata tttgcatctt tccaatacta 
85441 cggtaggaac ttctttcttt ccaaaactct 
85501 gaagctgaac catgtcctgt gatcgctatc 
85561 ctgcacatca attagaggtg tcaactgggc 
85621 caatttagtt taggtaggct atgaacgcta 
85681 tctatggtct aactgggcaa cgaccaaatg 
85741 tatagttttc caatttattg aaaaacacaa 
85801 aatagaaaaa cataagttta cgattttagc 
85861 aacgtaattt tatagttttg gcgggaaaac 
85921 cttttatggt tttggtggaa aaacgtaaat 
85981 accattttgg catgaaaaac ataatattcg 
86041 agttttggcg agaaaacatg atttttttat 
86101 tggcgaaaac aaaataacta aattttggcg 
86161 aacatgattt atcggttttg gccgaaagca 
86221 tttccggttt tggcgggaaa tcacgatttt 
86281 gttttcgcgg aaaaaaaatt ctcggtttta 
86341 aaccgcgatt tctcggttta gacttacctt 
86401 agcatttgtg tttcttaatt gaattgtgag 
86461 ttggttatat tgataaacag tgctattcct 
86521 tcgtttttta ttaaaccata ttttatcttc 
86581 tctcgataat tcaagcttcg tttatgaaga 
86641 tttgcgttta tgatctattg ataatatgaa 
86701 ttttcataaa ctgatattag ttttattttt 
86761 agatataata tcttaataag agactctttt 
86821 gagttgtatc tttcataaaa tgagttaaca 
86881 aatgtgtgtg tctttaccaa catgttaaaa 
86941 aacaatcaaa tagtttatga gaatttattt 
87001 gcaattttat tttattttac atgaaaacat 
87061 gtgaagaaat acatccttta agaatactaa 
87121 tgtaacacat gtattttctt aaaaacaatt 
87181 atagcatatt tatatttata aatgagttcc 
87241 ttaataataa taataagatt gcattaatac 
87301 atgtctttgt ggccaaggct atgtgttaat 
87361 tttgaagcgc gggatatttt acgatgaaaa 
87421 taatttttaa agagtgtatt taaaatattt 
87481 cccgattgtg attataccgg ttaatccgga 
87541 tattcatatt aaaaaatcac taaaacccga 
87601 tatgtaatct aattggattt aaattgtaat 
87661 attttaaagt tcactatttt gcaatttatg 



gtaaataaaa atttgtcatt actcaaaaac 
acttatcttt atttttcttt ggcgatagac 
tcataaatat gttatgtttg gtaaaatatt 
agtttctttg accacatatt tactaatttt 
ttacttattt ttgacagatt tatttggatt 
tttaattaat tttgacatat ttaatatgga 
tgtgtaatta caatttgata gatttttgta 
gtcacatatt aattaaattc accttaacca 
taaattctaa aattttatat aatttaaatt 
attaagttaa ccaaatcata acaaaaactt 
tctacaaaag aacatcacgt gtcattcttt 
ttgttttcgt ttttcactca tggctctggt 
cataatcaag aactctcttt gcacactcta 
tatttcatat gtcaatctct acagggcaca 
tcataaatta aaatggaaac gtgaatattt 
atttcaaaaa tcaaaaaatt gttttttcat 
aaaacgattt taaaatatat tttttccaaa 
catggtttca ctaatccata gggggtgtta 
gatttgggag tttaagtttt ggtcaaaaaa 
aagtgaatcc attgaaaaat tgtctctaga 
actcctctac cagaatatga cgtgaggaag 
ataacaatct aaccattgtc ctcttctcga 
actgaagctg aaccatgtta ggaccaacaa 
gcgtgatcaa cattacaaat gtcctcttct 
acgaaattct gaaaaagaga gacgaatgag 
gggctgggct gtccatggat tagaccggtc 
attggtctaa ttgtacaaaa gaccaattgg 
gacaatgggt gtccataggc ctcctagaaa 
ttttaacgaa aaatcaaaac tttgcgattt 
ggaaaatgta attttacgtt ttggcgggaa 
gtaattttac ggttttagcg gaaaaacgta 
atacgatttt ggcgggaaaa aggtaatttt 
gttttggcat agaacgtaat ttaatttttc 
tttggcggag aaacttgata ttttaatttt 
gaaaaacgtg atttctcaat tttggcggaa 
cgattttccg gttttggcgg gaaatcacga 
ccggttttag cggaaaaacg tgatttctcg 
gcggaaaaca tgatttctct gttttggtga 
ttgtttgacc aaagctttgt gactctatga 
tcttgtgacc cctggtttgc atctcttcct 
ccttttccaa tttttgattt taatcctgat 
cgatcattca ccattcactg ccactaacat 
taaattattt gaaactttct atctaattat 
tctcatctaa tgttatttaa gagttatgtc 
tcaacataca actattaaat tttataaagt 
atttttaaga attgttcttt tccattataa 
ataataattt aaaacatgca cataaacttc 
cttttaagtt atgcggtatt tttttcatgt 
ttattattct caattttatc tctgtttaat 
gtatttgttt attcaaagat tttttctctt 
aaatgttgca aaatcgtatc cttccaaatt 
atattttttt taaatgttat tttataaaat 
cgtgtgatat cgcacgggtt ccttacctag 
atagtaact9 atggaaacca gtcgtagctt 
taatattatt actagatttt gatccgcgct 
atttcactaa taatttaaca aatattttgg 
ttgcatttaa atcagtattt ttaaattcaa 
gatctgacaa ttcaatttat gtttttaaaa 
gactaaccga ttgaactgat ggatgatcaa 
agtttcataa tttgtaatct tataatcgaa 
aaattatgac gtttctacaa aattttaaag 
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87721 agaaaatgat agatataaaa taattaagat 
87781 gtagtataaa aatatattgt ttggaaacat 
87841 tttggaaacg ttgatagtag tataaagaaa 
87901 gtataaaaaa gaaatattag tgatttaatg 
87961 ttaatttaaa aacttacaaa ataaatgtta 
88021 atagattact ctataataac caatggaaat 
88081 aagtgctgta tcaattttcc cttattgcct 
88141 tcgtttatag aaactatttc atttaaacta 
88201 ctagcaattt tttcaaaaaa aaaaaaaaaa 
88261 ttgatttaca tatcttttgt tcagatgaat 
88321 cttattcatt aatcggtttc acaaacactc 
88381 ttgctaaaac aaatagttat ttattataat 
88441 gttcttgtta tacttaattt ttttttaatg 
88501 gatatattta attccgcatc tatctaatct 
88561 tgatgttgaa gttggtattt catatatgaa 
88621 gggttcaccc cctaggatga acctttaggt 
88681 atctaatatc ttttaattaa gaaaacaaaa 
88741 aataaataaa aagattaaat aaataaaaat 
88801 aaaatattta tttataagat ttggagttta 
88861 caaatgttta gtgtttttcc aagggttagg 
88921 ggtttagggt ttaggattag agtttagggt 
88981 tttttccaaa ggtttatggt ttatccaagg 
89041 tagtattaga gtttagggtt tagtgttttg 
89101 ttatatacta tttttattta tttttaaatt 
89161 attatttgtt ttcttaaata aaagatacta 
89221 tgcatagggg gtgaacccaa aaataactca 
89281 gtctaacaga tttccagtca aattgttttt 
89341 tacatataca aaagattata ccatccatat 
89401 tatagtcaaa gttttgttat ggaaaccatt 
89461 gttaaaaaca aaatattata tcatccattt 
89521 tgcgttagaa tatagaaaat taattatgaa 
89581 tggaagatct atcatgaata tatataaaca 
89641 gaagaaaata caaaaaggta aatcaagtaa 
89701 catatgcatg tcaaggttag aagttttctt 
89761 ataagtattt aagcatctcc aaatactttt 
89821 tttatttatt catattgtaa aagcgtatat 
89881 aatttgtttc ttaagagtta ttacactaga 
89941 atttttttat ttctgtgaag gatacttgga 
90001 tcatggcaac caagaaaagt ttatctatta 
90061 taagcctttt tcctacaatt tcaggtaata 
90121 atgattttat taacttaagg caaatagaaa 
90181 gaaatcaaga agtttttaga tgattcgaaa 
90241 tatgttaatt tgctaatttt taatttgaaa 
90301 ataggtaacg aaggagaatg tcaacaacag 
90361 acctgcaaag caagatgtat ttctctgaat 
90421 gcaaatggcc cagacaaacc aattacttat 
90481 gtctgagact ataatatttt atcaatttat 
90541 tatttttatt ctaaattata tctttatata 
90601 tacattaatt aagccttaga aatttgaatt 
90661 atgttgtata tgtttttaga tctaaatgtg 
90721 aaaatatatt gaattattaa gaaaaatatt 
90781- tagaatgttt ggtttggaca tatccttgta 
90841 tttactaaaa ctatgtcata ccatataaaa 
90901 atcaccactt tcttacattt cacaaactat 
90961 ttgacttagg cccctcttac attagttaat 
91021 aaattagagt taagatttat gataatgtat 
91081 ctaataaagg taacatatac tcggcctccc 
91141 gggcaatgca ggagctgtta aaaccgaggg 
91201 gagcaacttg tgaaacttat aaatactgat 
91261 gatgaaattg cagcactgtc taaagagttt 
91321 actcagaatg cccgagcgga cggcttggca 



taattatttt attatttgga aacattgata 
tgataatagt ataaagaaat aagtatattg 
taagtatatt gtttggaaac atggatagta 
tatgtttaac tataaagtat aaaagtatat 
ggtccaacag aatgtttctg ttttaataag 
taaatgggta ctaaatcata taaaatataa 
cattacttga tattttgttc gtgaatatta 
tgtagagtag ttattgtagg aggctaaaat 
ttctagcaat tgccaaatct tttgttctat 
aatatatttt gcaaacccta tatatataat 
cccctttcta aagataaaat agtttttgag 
tctaataggc aattctcaaa aaaaaatata 
tcaatagaag aataaaatac aaattttaac 
tatttatttc gtatatgtta gtatactaga 
agattatata tataattatg actttttctt 
tcaccaacca atagaaaatt gttattttaa 
ataacctgcc aaattatatt atgcttttaa 
aacaatagtt ctcaataaag attattttaa 
gtgtttaaga tttatagttt agaatttatc 
gtttacccaa aggtttaggg tttacccaag 
ttagtatttt gttgagaaca tgtttagtgt 
gtttaaggtt tatgattaga gtttagggtt 
ttgacaacat taattttttt taattcgttt 
ttattttgaa aaaataatat aatttgacaa 
aatctaaaat aacaagtttc tattggttgg 
ttaaatcaat ttatagcaaa tactaaccta 
aaggaaaccc ttacataata cgaaagatta 
atagtaaatt ttaaattcgt tgaacaaatt 
acattacgag aaatatttta tatataaaaa 
atagtaaata ataaaaactg ttgaccaata 
tacttataac aaaatatgtg ttaccaaaaa 
caactagacc aaacgatcaa atcacaataa 
gaaagtcttt tttttttcaa gttagaaatt 
ttgaagaaga ctaaatttat ttctttgaaa 
tagtatacaa agttttaatt tttttctaaa 
gttttagtat atgtaaaagt acgtaatcag 
atctcgtctg atgttaattt tgaaatctgt 
caaagagttt ctgtaaccta cacatacata 
tgttgtctct gttaatggtc ttgactctca 
ttttttttaa ctttaggaag tatttatttt 
atattctacc aattcgaaat tctctgaaaa 
attcaaaaag actaaggagt tgagttaaaa 
gtaaaactaa tgctaattat attgacttaa 
gggctatgtg agggcgcaaa cccggaaaag 
tacaatctcg ggggaatttg cttaaaacat 
ttttgttgtt gcaagatata aattaataat 
aaaattatta gtttacaaaa gttccctttt 
aaataagaaa gtaattaatt gaactgtata 
tcatattatg ttttatattg tttggtgtat 
gttttagata tagttttact aaatactatt 
tataatgcca tgatgaatat taaatagcat 
tcaccacttt cttatattat gtgtcctata 
ttagaatgtt tggtttggac atatccttgt 
gtcataccat ataaaattat tttgagacat 
tatggatgat aggttttaaa attcattcag 
agaataagta ata'tatgaaa agaatgttta 
cactccatgc ggaagcagaa ggtgttattt 
aaaagagaga ttcagctgcg atcagactgt 
atggaatggc cagcattggc ctctgagttg 
atgagcctct ctatatgcgc aatcccgaga 
aaagggggac gtacacgcaa gctcatcccg 
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91381 ttcgttagtg actccgcacc tacatggcta 
91441 taatctttta tttttattcg atgtcaaaaa 
91501 atattatgat tttatggact gatacaattg 
91561 tgagtacaac cttctacttg taaaatatta 
91621 accgccttct atttctcttt ggctgtcctt 
91681 atatggtctc tggcgagtat taagtatctt 
91741 atgttttgat ttgaccatgt ctgttcggca 
91801 attttcttct tcttatcggg cctttgtaat 
91861 taaggtgaaa cctacctcca ataataatca 
91921 gtttattccg atttggatgg tatgtttgac 
91981 attctggttt attgaattag ttgtgagatt 
92041 cccttcttag aaagtcaatc ttcttagctt 
92101 cagaatataa cattcttgtc ttcttccttg 
92161 aatggtcttt ttctctttct atatattttg 
92221 tgtcattatt gtagtatgat tgttcttcgc 
92281 tttgagttat tctctttgac tgtttatatc 
92341 cttgagattt tgttcggttg tttgattttg 
92401 tatagtgatg tttttgcctt cggcatcgtc 
92461 tgatatcctt atacattttt ggtttgtttg 
92521 tggccaaatg tcattatcct ctccaaccaa 
92581 gtgacttgtg aatataattg gaaaaaacag 
92641 caaacaaaaa ggcagatttt aattattttt 
92701 agatgcaaac gaatttaaaa accaacgttt 
92761 tagcaacaat tatgatttcc aggcgcataa 
92821 tgggaccttg gcaaaactta gcacaattat 
92881 taaatatcat tagtaatgag agttctacta 
92941 acgagaataa aatcaaaaga gaggtccacg 
93001 agcttcacgc cacgtcagat gtgtcattta 
93061 acatcgttca aattcgagaa gcccgtgaag 
93121 tcctcttcgc gtcttgctcc tttcctatta 
93181 gaaacgtttg atgtagccct ggaaggaggc 
93241 gtcttctcac tgttaacttt ccagtaaata 
93301 gacaactacg cttccagaga ccgtagacag 
93361 cgaactgcag ttccccattc gttttctccc 
93421 ctcataacca tctttcctaa agctcattca 
93481 tttcaacttc cggtgctctc gattcttcca 
93541 ctgcaaccat ggcaaataaa acgtcctcat 
93601 tataaaacaa tgttggccct cgatttcggc 
93661 cagtcttagt gtctcttttc ctttcagtgt 
93721 tcatttgcag gcggagtaag acatagattc 
93781 attgcacaga ttgaagaagt taatgcagat 
93841 aaaagggaac attcagtata tctctcttag 
93901 gaggattgtt tgttagcgta tgttttcaga 
93961 tcagttactt gatcaaactt gagatcacaa 
94021 tcaagttcca aaggttagtc atcaacaaca 
94081 ttaacacacg gctgctctga aagccatccc 
94141 tgcagcaact gcctctatga aaactgattt 
94201 agaaaagtgg aggtactttt ctcaggattg 
94261 gattatgctt cttgactttt gcaggaatca 
94321 gcaggccgcc agagaaaaag ttgctgatcg 
94381 ttgttttttc aaatattttg attacagtct 
94441 actaagctgc caagcaaggt acaaggttat 
94501 gttgaaagtt gtttcttttt tgttcttttg 
94561 gatttatcct gtttactata ggctggtgca 
94621 tcatgataga agcaatagtg cacataaaca 
94681 acaatgaaaa aaaaacgtac aagtaagact 
94741 tgagtcgctt cgccagtttt gctctataac 
94801 tagctttgtt ctaaacttct aagtcgtggt 
94861 tcttgagtca aagagagaga ttgattgctc 
94921 tcatggctcc cgagaaatcc tctcagccgc 
94981 cagagagtca aaggtaggag gttatcttga 



gcccctgaag ctagtctaac ggctgtagag 
aaaaaaaaga tcttggtata aaatatatac 
tagtgaccca agtgcagata taagagtaaa 
agagatagtt tgtgaaatgt aaagcttagg 
tatgggtact gacggtggat tataatggcg 
taagatcaca tttactgtcc gatcttcttt 
cagactatcc cctcatcttg cgatagacct 
aattgacccg gttcttggac cggtatggat 
tccagtcctc ggt cat cage caggtcaggt 
gecgaaggtt tatgaaaaac ccagtaagag 
tgattgatgt aaatcatggg atttgatttt 
ggcaattatt tccttttgtg gtttctgtct 
gtttttcatt cttttaatca aaagaaaagg 
tttctatttt gaataatgtt cattttcttt 
cttctatttc tcaagtgttt tagctactat 
gaagtttatg tttctgtcga ggttcataat 
ttttttatcg tttgtttttc ttttagttta 
ttttggtgag atttctctcc tttgegtetc 
ttctgttgat cttcttattt catcatatgg 
tttttaattt acttattatg tagcaattat 
tggttgacca tggtgtgaaa acattgtegg 
gtgtataaaa ttacaatgea aaaggtaatc 
aaagatttga ttatgtgtgg aggaagtttc 
ttatgataat tgcacacttg tgcatgggag 
gataattgea cttgtcaata cataggaatt 
agaaaaaaaa gtaatcaaac ctatactaaa 
ttgacaaaaa aaateggeca ataggagtag 
agtcgttggg cttcgaattc cacaggcccg 
cccatttcac atctaagctc cctttcccct 
atcgatgtga aacagagtga aaagatctaa 
aaaaatttca tgctgatttt tgccgtgttc 
ttctctattc aatccttget tcttttgttt 
gaagaggagg agcctctgtc ttcctcaatc 
ttteggatte cctcttcatt gcctcccgat 
ttactgcatg aatcaaaagc caagcaacag 
ttcatcgcac ttacatagac taaatgatcc 
caaccagaaa ggcaagaegg cgtgcgttat 
taatttcatt tctgggaatt aegtgagtat 
atcatatatt ttatgaacta ataacaagaa 
tgeaatgetc tcctctctgc cagagaagaa 
gtgeagaaac aacatcctta aggtaatcta 
tttcttcatc aaattgaaca cgagacattg 
gagctccaaa tcctccatct acgaccctta 
atacaegcat caactttacc gaeaacgett 
tatccgtttg aacgttacta ttttcttatt 
agatatatcg gcgagaaaag aaaagaagtt 
agagaagaag aaaattgagc aatgtaagct 
tattatttga agcttggatt ttgaaacgga 
catacgagca aggaaatctt ttccacaaga 
agctatttat tttttaaatt gcacaacctt 
tttcattatt tatgettaac tgcttgagga 
ttgaatataa aagtccaaca gtgtaaggaa 
tgctatcttt aaatcgatga caaaacaact 
tgtggaacgg tacaatagac tatggagaat 
gattcaegga gttcaattgc taacacagac 
cctttccttt attgtttttt ttatcgttta 
ttttttttac attcattcct ctttaatctg 
ctctcttaac atcttgaact cttaattttg 
ccatgtctca tctgttgaga agaaaaactg 
cggacatgga gacaaagttg gatctagagc 
tccagcaaaa ttaatcaaaa agttcgtgga 
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95041 actttttttt tttattaaaa agatcgtgga 
95101 tcagtctcag ccagcacaac aatgaagtag 
95161 aggaaaccag cattcctcct gcgagctccc 
95221 actctggcca aaacaacaag ttccccgtta 
95281 attacaggat gcttttattg tgatttttat 
95341 tctgctatgt ttctagaaac aatattttaa 
95401 tttatatcat gttcattaag attctttgac 
95461 gacgtctgct acaaattgta taagctcaat 
95521 gtggagatct ttagacttcc tttttgttga 
95581 tttttaaggg cattcccatt atgttgggat 
95641 cactaaaaaa tgatactccc tctgtttcga 
95701 ttttaaaata agtgtcgttt tcggttttca 
95761 ctatttttct attggttgat atatggttag 
95821 aatatgtaaa actaaatgtt ttcttaatct 
95881 tgaaacgaag ggagtatact gtcaccaatg 
95941 ttttttttct tctcattatc ggtttagtta 
96001 taagtcctcc atttaaaaat ttaacacatt 
96061 tcatttaatg atgcatatta tcatacaaaa 
96121 attaaccaag cagaaaaaat agttactaat 
96181 aataaatatt ctcagaattt ttataaattt 
96241 gaaaaaattg ttaactaaat aagatccttt 
96301 tattatcttt acctttgtat cggttacaat 
96361 cttaatagca aacgtgaaag gacaaccgag 
96421 ctgcagattt tttcaactaa aaaaacaata 
96481 cctgagatac taacatgttc aggttcgacc 
96541 ttatcttagt cacctaacac ggatatatgc 
96601 agagatggtc tatccgttgg atgcacctcc 
96661 atctgggata tccctaggtt aataaaaaaa 
96721 caaacgatag ttgttgaaga ttaagtctaa 
96781 gtggctatca tattagactg taaaagacac 
96841 ttggagctct caaatagtgg atgcgctctg 
96901 attttgagct tatagaagat catgtaatac 
96961 aacttaaaaa aaaaaacaga gcacccaaca 
97021 atctgtgaac aattacaaca ctaacgacat 
97081 gaacgctgag atcccaccga ctattccaga 
97141 ctactctaaa tgtccactaa acacattagg 
97201 cctgaggcgc actatattgt tcaatacttc 
97261 catctacgcc aatcagccac taggaacaat 
97321 atgcttgctg aaggtcacta tcaaaccggt 
97381 aagaaacgat caacctccga tcattgctgt 
97441 cctgttccta tggagcagcg gtactatgta 
97501 tgcgatccca ataacatggc taaagtatgc 
97561 caatttgtcc actttgctac taataaagaa 
97621 gttaaccaca atgtattttc taaattcgtc 
97681 ttctattgca tatttaattt taataacttc 
97741 taaagtgaca acttaaaatt gtataaaaat 
97801 tccttaacaa cttaaaatta aaataacaag 
97861 aaataacaat gcaactaaga ctaaaattgt 
97921 tacgaacttt gacagttcaa tcaatatgtc 
97981 attttacaaa ttgttttact tcattttatc 
98041 aattttagct ctattttaaa aattgttttc 
98101 cataacaatt taaaaaataa atttcacact 
98161 cccatttaaa aatgtattat gcttcggctg 
98221 ttaaaaaaaa aatctaatat attctcatct 
98281 aaactttcac attgtacaat atcaacataa 
98341 cttaatatta ataaaagatt ctaaatctaa 
98401 taacgcggac aaagactcta gtatacatga 
98461 ctttatttat tttgttttaa ataaatccta 
98521 caaagttaat tgacagaggt tcgggctgca 
98581 gccgaaaaga ggtccaaata atataagpta 
98641 caacgcctga aaaaaagagg aaaatctaaa 



actttgcata caaactttga aagcaagatc 
gaattttacc acaaaaaaaa acaatgaagt 
atcctccaaa ggctaatgac caaactgttg 
taagcgaaaa acttatcaac aacgtcatcg 
tgttttgtcc tttagtttag ggttaaatta 
atcccattct ggatatcgtt tcggcataag 
gtcaaagttt aagcaaaaaa aagattcttt 
ctactatgta taacacccat tatctggaat 
cattggagta tagcagatca tat at gt ate 
eggaagecat aactgtgtta tgttcactga 
attatttgtc gttttagagt agaattttcg 
atgeaaaatt tattgacaat attctctgtt 
gtgtattgaa aatggtgttt ttattttgaa 
gtgtacataa acctagaacg acaagtaata 
ccgttcaaat gtcaaaacaa tctaattatg 
ttgtcataga tttttaatct aataactata 
cattaaattt taggtcatta attagtatag 
tgttttaatt ttggaaagtc aaaataaatt 
acaagaaaag gaaaatgata tattatttta 
attacatttc tgataatttc aaaattaaaa 
ttgtttggaa taatgetage ataaaaagct 
gagcattata acattgettg gacgattcaa 
atcaatgata gtatcgagag acaceggaga 
acaatcatgg tcgggtggta aggaggegga 
catctgctac ggaaaattaa gttatagtaa 
ccatgtttta ggatccattt gaatatccga 
ttttggggat tagtctaggc ttctccatgg 
aaacaatatg tttttttggt aaaataataa 
aagcagaaac atcactaccc cagtctttta 
aaatggatct tgatgtatat atatcaaatt 
ttacttcttt tccgccattt teegcttaga 
taaaacattt cctcaagaga ataagagtaa 
aaacagacca agtacaatgg gatgagaaaa 
tagtgatgtc tctgactacg atgattactc 
aacatcagtc tgegataett tgctgcgaaa 
ttcaaagaat tgtgtttaga gaacggcaat 
gtccacaaag aaaaacagac aggectcttt 
ggaaataata tgcatcttta cgttttgtta 
aaaaaatatt tggataaact ccaatggaaa 
gaaagaatca agaactcact aagtgcaata 
aatatggtca accttaagcc acatacaaat 
aagcaatgtt actaattcaa aatgettaac 
tgtttatttt aaaaaatttg tacaatattg 
cgtattaaaa taaaaattaa gtttagtagg 
aaaagaatga agtagtgctc tgatagaaat 
aaaaccatcg agtccttaac aactatcaac 
ttgtctaaaa aaaggaaaaa ataaaaatta 
gatgaatata taattagtat aaaattagta 
aagcacttca aaaacaaata aattgttttc 
tttataaagg aaaattttgt ccatttcaca 
ttagataaac ttatttacaa aagaaaaaaa 
tcaaccactt cctattccct ttactcatat 
aaataaatgc accgacttta ttttcatctt 
ttttcataaa tttaatcttt cccactgtga 
gaaaactcat ttaaacatac caaataaatt 
aaaaaaatat ctaaacctat tttccacgcg 
aatacaaagt aaacaagact ctaaacacac 
agaaattaag cttccactag cagtattcaa 
aggeccaata agtaacaaac atatgaagag 
aatccaaatc aatactgtcc agegtgaaga 
ggaagegtat aaagtgacgt gagaaatcac 
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98701 gaaatcacta cctatatata aactgaaacc aagagagaga gatccctctt ctcctccttg 
98761 tctccacaaa ccctaaagcg ctttatagtt tcagtttctc tcagaatcgg aatcgtttgc 
98821 ttcagtaagt tcctcttcca atcccttact tgttgatacc tgtttattcg taattaggtt 
98881 atgatctttt gattcggtta gatgatggat gattctctct tcgtaattag ggttcttatt 
98941 tttttcaaga tccgtgttct gcaagctata gatttatctg ttgacgtttg aaagtatcga 
99001 ttaatgcaat ttcgtctttg ttcgtaatca gtttgaagat cggaaaaaac aagaatgagg 
99061 attggtgttt ctgagccaga gcaatgcggt tgcgacacgt gtgtccagca tcgcacttta 
99121 tgcactcaag aaaccgaacc tagcaaagaa gtgactggct catcggttcc tgttagttca 
99181 gaaccagttc aacgtctcgg ttccacctca gatcagtgtt ccggaacaca tacgactcca 
99241 ctcgctcctc ctgaaccagc agcgcagtct gttgatgcat cctccacatc atcctccata 
99301 ttcagttctg ttagttccca accagcgcga gctctttgtc ccaccggttc actgccagtt 
99361 cctttatttg gttgttcatg gccacgtcct tgtagctgta ctggttgttc actgctaggt 
99421 ccttctattc gtcgttcatc tcccttcttt actgcttctt ccggatcatc aatctcctcg 
99481 tcaaggcaag ccaacgttac aaacagtttc ggttccgctg catccgaacc atctgtatct 
99541 gggccaatga aagctcctat ttttacttct ggctcttcaa ccgcttccac atcctcaact 
99601 ctaccttcat tagttactcc ctcggacatt acaagaggat cagtgcaagc gcctgtccaa 
99661 gctaacactt ccaagactgc ttctgatttt catccaccta acgttgccaa cactggagtt 
99721 tgcgctgctt caaggactag cacgaacaat ccatttccag gatttagtgt tgattacttg 
99781 cccagatgtc cctctaacct ttctcgacca aacgcaccaa ctactacacc agttcctggc 
99841 cctagttcag ttttggctgg tggtgaaact gaacaaggta gtaggtatcc tcgttatgcg 
99901 cctacaccag atgttgacgg caagcagatt atttccatat ctgcttccaa ctcacatgga 
99961 cataaaagtc atgaagagtt gaggtgggaa gattacaaaa atggagacaa aggtaaacta 
100021 actcacttac tcaaatcttt aaatacataa cgtattatag cattttatct tttagatctc 
100081 tctgtaattg actaatctca tgttgcaggt ggggttggga ggtttcatcc ggctcacaaa 
100141 cctcgtaaag aatcgaggtg ggataaagaa aaaaatggag acaaaggtaa cttaactaac 
100201 tcactactca gatgtttaat agcattttct gttttagatc tctctctcta attgactact 
100261 ctcatgttgc agctggggtt gggtcgtttc ctcctcctga tcatacaccg tcggtgttta 
100321 ctcctccaag catacctgat cgtcctcgga tgagaactat tgatctaacg aaccgagaca 
100381 cgagtggttt tcctattggc tacaacaccc ccgctgcttt ccagagcccc catgaacccg 
100441 ttggtgtttc ttccccagca tcaggatgca cagcgtgtgg agccgcgagt agctcctctc 
100501 cttcgagtca cttgggcttg aacagtacca caaatcctcc atcatctgcg acatctcttc 
100561 ccgggctgtt cttttccacc tatggttctt ttcctttgct gtttgcaaca ccaaatcttg 
100621 cagcttatgg tacaactcca gcagtccaag cctatcctat gatgtttgga ataccaaatc 
100681 ttgctgctca aggtacagca actccatcag ttcaagcgta tcctatgata tttggaatac 
100741 caaatcttgc tgctcaaggt acaacagcaa ctccagcttt tcaagcgtat cctatgatat 
100801 ttggaatacc aaatgttgct gctcaaggta caacaacaac aactccagcg gctcaagcgt 
100861 atcctatgat gtttggaata ccaaatcttg ctgctcaagg tacaacaact ccagcggctc 
100921 agccctatcc tacgatgttt ggaacaccaa gtcttgctgc tcaaggtaca acaacagctc 
100981 cagcagttca gccctatcct acgatgtatg gtacaccaaa ttttgtagct caaggtatga 
101041 ctccagcagc tcaggcctat cccgttaatg gttcaagtct tctcccattt gccgccatga 
101101 gtctgcagta atcgtgcctc gctctgcatc aaagttgtgt cttcttttat cttgtagttc 
101161 taaggtcttt gttttgctta agagtcataa atcaagtctc gtccaacttt atgaattttt 
101221 ttttctaagt aaagattcag ctaatcatgg atgtaaacaa gtctcttaaa acatcattat 
101281 tttgctttta tcaataaaag ctcctttcct tctactgtaa tttttctctc ctctgttcta 
101341 ataaacaaaa aacaacaaaa agaatcatgt tctttgttat ctctcactaa gccatgcctc 
101401 aacatgaaca catgttgcaa aggcatactg agaaacttta caaaaacaaa ctgcaaaact 
101461 ctattctctg atctatccca accgcacaat caacacaata tccgagaaaa ataaaatttc 
101521 atgaacatat gttcttgtca acgcaagcta ctgtccaagt aacacttcca catcctttag 
101581 cttcggtggt tgtagggcat ttgttgcaag gattttggaa ctgaacaagg tagtatatgt 
101641 cctcgttatg cagctacact ggaaggacgt aacacatgta actgcttcca actctcatgg 
101701 acataaaagt taaggagagt atgaagtaga aagattacat aaaagggaaa caaaggtagc 
101761 tgactataac taaagattta acaacttaaa agaaaagaac tcttttcctt ctactatatg 
101821 ttctcttatg tcctatgatt ttgtgaaaca acaaagaaca cacaaaaatg tgtggcatga 
101881 atcattgcta cactctccat tcccaatcct ctgtttttgt gttttcagac atccaagtta 
101941 caaatgcgtt aaaattatga ttttctcttc aagtgaaatc atc.tgatgat ttcatatttt 
102001 acatattcct tctcatgtgg ggtttataga gggatatttt atgctgaatc tccccctctc 
102061 tttagtgcta attgttgaat cattagaaaa ctgcaactca cagatacgga gtaatgtcca 
102121 aaacaaacat agatgctgca ttaacattac aaatattaag ttcgtctact tcatttgata 
102181 cacggttaat cattttaata tctaaagata atacttcaaa cacggcgatt ttattt.acta 
102241 cccaataaaa atgttagcgg agttaatgtt caccacaaat aaaaatttat tatcaatcct 
102301 atacgataac tatttgttta tctcctcttt ctcaagcata tatatacttt tatatgcaca 
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102361 tttacagtat acattattta aaattaaaac aagaggtaaa ctacaaaccg aacttatgtc 
102421 caacatgaaa agttttttca acttccatga atcaatcgca tcacacttag atactgtagc 
102481 attgacaccg gtagcaaata catcacatat cacatgcatt tcaaattata gaacaaacat 
102541 ataaaaaagt tagattgtgg ataaacaatc cgctcgtagg gcagctttga ttttagagag 
102601 atttcctagt gctattctta cagcttgcca tactaattaa tgttcccctt tgctactgat 
102661 agtatatagt agtatactaa atttctaaga aggttaaact ttcattatta ttcaggaggt 
102721 tcagcccgta aggcccataa ctaagaaaca catagagagc ccaagtccac tagacactta 
102781 atttagctat ttcaaatcaa tttaaaaaga agattataag aaacgtccgg cgtgaaatca 
102841 acgcctgaaa gaaagaaaga aaagaaaaaa aaatctggaa gaagcgtaca agcgacgtgg 
102901 ggaaatcacg aaatcattac ctatataaac tgaaaaagca gagagatccc ctcttcatct 
102961 catccctcta ccgagtctct ctacaaaccc taaagctcga tttcaagttt ctcagactcc 
103021 gaatcgttac caaagtaagt gccctttcta tctctcatct cttcctttaa tactttgcag 
103081 aatttgattt cttttccgcc aaaagttctg atctttgttg aatactggtt agccctttat 
103141 tcgtaattaa agttctgatc tttgatttgg gtttagatga tgatattacc tcttcgtaat 
103201 tagggttctg agtttttgtt tttttttttg tttttgataa tatctgcgtt gcctctgttg 
103261 caagtaatta gggttctgag tttttctttt taataagcta tcgattatct gttgatcgat 
103321 tttaatataa atttcgtctt tgtttggtat tagtttgaag atcgagaaac aagaatgagg 
103381 attgatgttt ctgagccaga gctatgcggt tgcgacactt gtgtccagca tcggactttc 
103441 attactcaag aaaccgagcc gagcaaagaa gtgattggct catcggttcc tgttagttcc 
103501 gaaccagttc aacctcttgg ttccacctca gatgagagtt caggaacaga gacgactcca 
103561 ctcgctcctc ctccagtcac cacaccggtt aataatcctg aaccagcagc gcagtctgtt 
103621 ggctcaacca tcccacctgc tgttacacca gttagttccg aacaaccagc acaagctctt 
103681 ggttccacct cggatcaaag ttccggtaca gagaccactc cactcgctcc tcctatcacc 
103741 acgtcggtta agtctgttga ctcgaccatc ttcttcaagt tcccaccggt acaagcacaa 
103801 gctcttgccc ctactgcttc cggttcaacg caagcccctg cttttggttt tggtgcattc 
103861 gctgctcgcg taccatctgc cacctccggt tgttcagcat ttagtttcgc ccctcctgtt 
103921 acatcggcac cagtgcaagc tctaggcaca accactacta ctactactac tacatccgcg 
103 981 gccgctcctg catctccatt tcacagttcc tcaccaacca cattccaatt ccctcctgct 
104041 tttacatccc ttgctgcttc tacttttcct tctgttgcat catcaacttc atctccactt 
104101 gatgctcctc cctcaccatt tagatgggga tcactgcaag ctaacacttc cccacccttt 
104161 agcttcttgc cagcgcaagg ttctgacaag actggttctg cttttactcc accgtttggc 
104221 taccctggtg gttttgccag acctgatgtt ggtgtctctc atccagggtt tggtccctct 
104281 aaccattttg gaccaaacgc accaactact acacctgttc ctgttcgcag tccatttttg 
104341 gctggtggtg gaactgaaca aggtagtagg tatcctcgtt attcacctac accagatgtt 
104401 gacggcaggc tgataatgtc catatctgct tccaactcac atggacataa aagtcatgaa 
104461 gaattgaggt gggaagatta caaaaatgga gacaaaggta aactactcac ttactcaaat 
104521 ctttagtaca taatgtatag ctttttatgt tttagatctc tctgtaatta actaatttca 
104581 tgttgcaggt gggtttgggt ggtttcctcc tgttcataca tctccctttt cctcaccaac 
104641 ggtatcaccg tcgctatttg ctcctccaag catacctaat cgtcctcaga tgagaactat 
104701 tgatctaacg aaccgagaca tgtgtggttt tcctattggc tacaacaccc ccgctgcttt 
104761 ccagagaccc cctgaacccg ctggtgtttc ttccccagca tctggatgca cagcgtgtgg 
104821 agccacgagt aggtcctctc cttctagtca cttgggcttg aacaatacca caaatcctcc 
104881 atcagctgcg acatctcttc ccgggatgtt cttttctacc tatggttctt gtcctttgct 
104941 gtttggctca ccaaatcttg caacttatgg tacaacagca attccagcag tccaagccta 
105001 tgctattatg tttggggctc caaattttac ttctcaaggt acaacggcaa ctccagcttt 
105061 tcaagccttt cctattatgt ttgggactcc aaatcttgct gctcaaggta ctacaagagc 
105121 tccagctgtt caagcctatc ctacgatgtt tggcacgcca aatattggag ttcaaggttc 
105181 aactccagca gctcaaacct atcctttgat gtttggcacc ccaaatcttg ctgctcaagg 
105241 tacaacaaat attggagctc gaggtacaac tccagcagct caagcctatc cgttgatgtt 
105301 tggcacccca aatcttgctg ctcaaggtac aacaactcca gcagttcagt cctatcctac 
105361 gatgtttgga acaccaaatc tagctggtca aagtacaaca acaactcgag caggtcagcc 
105421 atatcctacg acgtttgctg ttcctcaagc tgcgacagct ccagcagttc agccgtatgc 
105481 tatgatgttt ggtacaccaa gtctcggagc tcaagatatc actccaggag gtcaagccta 
105541 tcccgctcat ggtttaactc tcccattcgc cgccatgagt ctgcagtaat tgctcctcgc 
105601 tctgcatcaa actcgtgtct tttatcttgt agttctaagg tctttgtttt ttccttaaag 
105661 agtcataaac caagcgtcgt ccaactttgt gattttcagc tatccatgga tgtaaacaag 
105721 tctcgtgtct tttatctttt agttctaata tctatctttt taaaacacaa acattatgtt 
105781 ggacctaaca tttattttgt aagtttttaa attaaataca catttatact ttatacttaa 
105841 acatatatta agttactaat atttctttct ttatactact atccatgttt ccaaacaata 
105901 tatttatttc tttatactac tatcaatgtt tccaaacaat atatttttat actactatca 
105961 atgtttccaa ataatacaat aattaatctt agttatttta tatctatcat tttctcttta 
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106021 aaattttgta aaaacgtcat aatttcataa attacaaaat agtgaacttt aaaatttcga 
106081 ttataaaatt acaaattatg aaactattac aatttaaatc caattagatt acatatcggt 
106141 catccatcag ttcaatcggt tagtctcggg ttttagtgat tttttaatat gaatatttta 
106201 aaaacataaa ttgaattgtc agatctccgg attaaccggt ataatcacaa tcgggttgaa 
106261 tttaaaaata ctgatttaaa tgcaaaaata ttttaaatac acactcttta aaaataacca 
106321 aaatatttgt taaattatta gtgaaatttt tcatcgtaaa atataccgcg cttcaaaagc 
106381 gcgggtcaaa atctagtttc tcgttaaact cgctgctcaa aataaatttt catgaaccta 
106441 ttttcttgtc aacgcaagca actgtccaag ctatttttat attacatgtt ctttctctgg 
106501 gttgttacaa cactatcatt ttgatattct ttttgaatga atagaaaaac caaaggtggg 
106561 aagtttacaa ctttgtcaac tctataattg tgttattaag gattccattc cgctaacaat 
106621 tttagtttgc ataatagttt tttctctgaa ctggtacaac accagtaaac aaatgattgt 
106681 cttctaaatc gcaggactaa gtctctgttc tcctacattg actctaatta aacaaatgtg 
106741 tcaacttgat gatttttact tggtaaagac atctaccaat cattgatgaa aacaataaca 
106801 tttcatgcac attattcctc tgaattagta caacatcttt ctttagtaat tcttttgaat 
106861 aatattcaac aagaagaaaa ctccttttct ttctactata aagtctctat cattctctca 
106921 tgcagaacgc acacacaaaa taaaatggca agaatcataa ctacactctc tacacctctg 
106981 tttttcttct tcctcttctc cctgctctct caccaaacca tttctcaacc cgaacacttg 
107041 actactttct gcaacccttc cgacaacttc acacaaacca gttcatacga agcaaaccga 
107101 gaccttctac tctcctctct ccgcgtcagt tcctccctcg gaacctattc aaacgccaca 
107161 gtcggtcgta gtcccaacac agtccacggc atgttcctct gcagaggaga caccaccgca 
107221 gcgtcttgct cagactgcgt ccagaccgct acaatcgaga tcgctacaaa ctgtactctt 
107281 aacaaagaag cggtcatata ctacgaagag tgcatggttc ggtactctaa tgtttccttc 
107341 ttctctgttc ttgaggtcag accgagcatc gtcctttact ctcttcgctc tgctccaaac 
107401 tcgaatacgc tcaatgaaac gttagctgat aaattcaacc aactgattct caacgtgtct 
107461 tcgtcctctt tggttccgta ttttctggaa gatcaagaac ttgtgactca agcagagggt 
107521 tcttataagt ttgagtcaat ggttcagtgt agtcctggtc ttgaccggtt caactgtacc 
107581 gtttgtctca gatttgcgct cttaagagtt tcaacttgtt gcggttcacc aagttctgct 
107641 ctgatcttta ctcctaaatg tcttttgagg tatcaaacct ctgttttgtc gtcgccgcca 
107701 ccgttgcctc catcctctcc acctcctcta tcgttgccgc cgcctccacc gtcaccggct 
107761 ctatttttac cgcctcctgg tttatcgcaa ccgccaccac cgccgctggt tttcacgcgg 
107821 ccacagagca gcgggtcgtt ttctaacgtt attaaaggtg atcatatttt tgtatgtatt 
107881 tttgtttagt tttgatcata tgactttctt tttcaaaaaa tttgatcata tgagtatctt 
107941 tttgatataa tgtttatgta tccaactaca aatattattg acagggttgg ctctaatttt 
108001 attccatttt ttttgtggca ggaaaccaaa tatttgggag aattgctatg acgatggcag 
108061 ttttggtgtt tgcacttgtg aatttgtgaa cacatttgat gattttggat tgaaataatt 
108121 tgcaattcct tttatgtttg tcatgtgttg ttgttgtctt cttaagatca aaagccacaa 
108181 agtatgtgaa tattgaaaat catgtttctt ttttgaacat ttaaccatgt tgattttgaa 
108241 catctaatca ttttgatgtt tgtttgcatt tgatcaacaa aagaagaaga gttggaggct 
108301 gatatatcaa attataatta atttctgttt tacatactct tttgctatat caatactttt 
108361 aattttccag catgaccaat tgataactgt ttagtccatt tatttataat catgttttcc 
108421 taataattaa ctgccaatat ttaacatata gatatctgca aagacatatt agattataac 
108481 tgctactaca ccactacaca ttttaagatt atcacgaaaa caacttctgt ttttgaatgt 
108541 ttaaaaaaat caaacacacg cacaaatata ttaatatgct atacaaattc ttatataaaa 
108601 ttatggtttt atggtatgtg ttatacttta gcataactgt atgtatataa atatatatgt 
108661 atattattta tagtttgtat tgcaatattt ttaataaact aaacaattat tatatttaaa 
108721 atttaaaata aatcttcaat ttgtattcat aaatttatga ttagaaagtt aataataaaa 
108781 tagtaataat taatttcaga catatttaca gaaagtttaa aaaaatcaaa cacacgcaga 
108841 aatatattaa tatgctatac aaattcttat ataaaattat ggttttatgg tatgtgttat 
108901 actttagcat aactgtatgt atataaatat atatgtatat tatttatagt ttgtattgca 
108961 atatttttaa taaactaaac aattattata tttaaaattt aaaataaatc ttcaatttgt 
109021 attcataaat ttatgattag aaagttaata ataaaatagt aataattaat ttcagacata 
109081 tttacagaaa aaatacaaaa caacactata tatgcgcttc aagtacagat caagatttaa 
109141 tataggtcta acattttacc ggacccgaaa aaccaaacta gaacggatct gaaaatacag 
109201 gttcagattc gagtttatgc taaaatattt attggatttt ttttatggag atccacaagt 
109261 ctcggtttgg atctacgtcc tacaagagac ccaatcaggt attcgaaata ctcaaaaatt 
109321 attatatatt aggtaaattt ggatgattgt gtgtatttta gatatttcag atattttttt 
109381 aaatttcgga ttcaggtttt taggtataat ttcaggtttc gggtaaattt tagatttcca 
109441 aaaaatataa tttgggtgtt cgggtaaaat ttgagatacc tttcgagttt tcggatctga 
109501 ttttgtgtaa gtttcgatat attttctcca tgtattttta ggatttgcag gtactttttg 
109561 agttttcaaa ttcagtttgg atatgtcagg tttttttcgg gtcctatata cccaaatcaa 
109621 cccgaatccg aaataaaact gaaattatga atgttttacg tatattagaa ttatatactt 
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109681 gaaccgacaa gagttcgaca agctcctggc cgagcagacc taaaaaacta tagatatcta 
109741 tatggatcta gatcttcagg aaccgaaaga tgcggaccca ataagacttg atccgaggac 
109801 cggatgccta ggcctaatct gaatagataa atatgattac aaaaaataga tttatagtta 
109861 taaagaaacg agtatatgcc ggccaacttt aaattactat tatttctaca gtaaatcaaa 
109921 taggttttga ttaagattta tacacatata tgactacaac aaaatatgat taagaaaaaa 
109981 acacaagtat tacaattaaa tatctagaaa aagttaaaca aaaaatatac atgtcttttt 
110041 gaaagggcgg gtcagaatct agattctgaa aacccgaatg ttctcaaata tttaaaaaaa 
110101 tagtttcatt taaaataaaa aatcatataa actttttaaa atacataaaa acagatttta 
110161 tcaatacata acgggtctat gacaaaaaaa acagtttaaa aatattaaac tttttcttta 
110221 aattaaaata ctaaaaataa aatcactgtg aaataattca caagtttaac agcaggttaa 
110281 taaacaaaaa tcataaaata cgaatcagaa ttatttggag tcttcaaaat ttttgtcata 
110341 tttaaaataa caaaaattat tcatacaata attttagata aatttcataa aattcttaac 
110401 aaaatataat ttatcaagta taaaaatatt catgacataa aattaatatt ttcaacaaaa 
110461 atatacctgc agaatctagt gtactaatta aatctgcata attaccctta gtgtctattt 
110521 ttttaaaaga aatcagcatg caacaaaaat tcgttggtag atggactgat tcgcaacgaa 
110581 tacatcacaa aatcttagtt tgcagtttta cgctaagcaa ttatcaaact aaaacatcta 
110641 atctagtggt ctcatgcata aatgcatact gttaaatatt gaatgacggt ttaccaaata 
110701 aaaagttatg cggtcgtcga tatcatactg atatttcaag ttaagataac tgtttttttt 
110761 aaagcacact tgtttttagg gtttatcaac ttatcattct gtaagcgcaa tatgtacata 
110821 tattagcata ctcggtcaat gttatatgtt ttcattgtta aatgcagtct tctccttcct 
110881 tcgtttgggc atagtattta acttttatta gtattatagt ttatcctagt gaacccaatt 
110941 aaagtaagtt tgatagatgt caaccctttt aagttgctta aactaatcaa cacttgaaag 
111001 ttcgtcacga aataatccaa tacctatctt ttgagtgttg cagccatgct agtttggctc 
111061 agatcatctt gtcatgtata atctccgctt tgcaaccgag tctgggcgtt ataaatggtt 
111121 agtagacaaa ttcttttatg caatgcaata tatatatgcg gatactgata tattgactct 
111181 gcactggctc taaattttag gttacatacc ttatccggaa tcatttgata tacaacttat 
111241 cgctttgtac taccattttg gattttgtga ctggctgacc cgtttctgtt agtcttgcct 
111301 caccctgtat gtgagatgtg gaataagcaa cttattaaac ccaaacacta 'atattttggc 
111361 ttattttata ttgcaggaat cgaaatatat tggagaattt tcattacagc catggcaact 
111421 tgggtgttgg cacttgtggg tctatgatat aaagttttat ttatatttgt attccacaat 
111481 tgtatgtgcc gctattttct tgtatagtta ataagaagca acatacttgt tttctaagtt 
111541 atctcaaaaa agtttgacgt tcgatatgtt tactaaataa aatcctaatg catcaccata 
111601 taccttcttt actgtaagaa tgaatcgtcc aaaaacaact ctgtgtttca gacaacaaat 
111661 gtgtcaactt catgttttta ttagcttcat atggtttaaa ataatagcca gccggaaggc 
111721 tgcttaacgg aaccaaaagc tattatgaca gaaaaaaaca ggtgaaagag gagttaacat 
111781 cttgcacaag tgattaacga agtcttattt gcttgagaga tcaaggattg aaaccttgat 
111841 tttgcaaatt tattgctctt caatataaca tcaaaataac agaaaatctt gccattacta 
111901 tgttttcaaa ggcttacata caaaagagaa aaaaaaagct tatcaaatga gaaacgttaa 
111961 ggaatatggc acagattaaa cgttcccctt agtgtgagct caatctattt gcgtacgtgg 
112021 caagtatttc aaggaacact gtttttggtt atggcttttc aaaacaattg ttgcttaggt 
112081 ctagggctta ccattgccaa ttttatgtgg aaatgctgaa gataaaacca aagcgtaaat 
112141 tggcgtaacc acttctttta ctgctgtatc tgcctgctac aatgaatgtt tgatgttcat 
112201 agaaaattaa gggaaacaga ggagggacgc tctaaaaagt tcctactaaa ttaacatgta 
112261 atccaacctt caaaacacac ttgtctttat tctttacatt gttaaagcaa tcaatacatt 
112321 ttcgtttaat tctccttgga aatggttggc ttctccatga acttggagac aaccatacca 
112381 atagcaaaga cgagaggcca cactgatact ttgacaggac cgtctcgtcc cctgtcagta 
112441 aagctacaat tgatcgtgga cactattaac tgaccaacac cgcagaaaaa caagatagca 
112501 tccgtggaac ttccaaagat tctgttgtag ccacgtggtg atggatagta gaaccaagga 
112561 aaaggcaact tgcaacggaa acgagcttta tgggctcggg ttttgtaaga gaagtcaagg 
112621 atagatagga taagagaaag agttgacatg actaacgaga tcaattgaag gaagacagac 
112681 aaacaaacaa gtttaggagg tgttccccgg ctatgatgcc cacataggtt aaagcttgtc 
112741 tccatctttg ctaccacgac atgtttcctt gaaagctttc ccgaaatctc cagtctgttt 
112801 cctaacatcg gatggatcta ctccgtagaa aactgtcgtc acgatttgtt ccttgcactt 
112861 caaaatctcc agcaggtttt tcaaacacca gctggaagaa gcataattct tggaaagcac 
112921 cacgatcgat atcactgatc ccctaatagc ttggcctctc tcagtccctt gatcatcgaa 
112981 cattaaaatc cccttgctgt ttaactgtct ccgcaaatga ctgataaaag tcttagggac 
113041 gtcgggtccg tagaaactcg caaagagttg gtagctccaa gtgcgagtcc taaaagaaga 
113101 agaagaagaa gccatgggat aaagtattag ctagggagag agggttacaa gatcagatat 
113161 acccaaaaaa aaagtctggg gataagccat atatgatcca gaaagcgcca gtgttcactg 
113221 tttaataaaa gaccgaacaa gggttttttg gcattttcag tagccaacaa acttctttgc 
113281 tctttctcaa aagtgtacaa gacttccttg cacttcaagt ctctccaaca gctctatata 
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113341 agctccatat ccatgaagca ttgttctttg 
113401 gtttttagga gcagagaaga gcgggcatct 
113461 atcttataga aaacctcaaa agcttacgtg 
113521 agttaattag gttttacata gaagaagaac 
113581 agctactaaa caccaacagc tcagattcat 
113641 gatgaacctt gagtgagtct tcaccgcctt 
113701 ctaatcgtcg gtcttttagc acctgaggcc 
113761 caagattgac actaaagacc atataatgag 
113821 atccatatgc atagtttaca caaaagatca 
113881 tttaggatac aagttactat gcaaaaaaga 
113941 ggagctagtg gcaattgaca gacgagaagg 
114001 ctgcttcgga atcttcgatg gaatactagt 
114061 acataagaat ctaaaaaaat atgtattcat 
114121 tacttacttg tctttgggca cggttttcac 
114181 gaccaagtgt ggttcaggtt cttccgttat 
114241 agtcaactcg actgcactgc atatggagcc 
114301 atcagagata tcactcaatc tgtcctcaga 
114361 gtcatcttca atgtcaattt cttttgaccc 
114421 ctttcgatgt ttaagttcgt ccgatgaacg 
114481 gtattctgag cagttgtcaa aatctgaggc 
114541 tcgtcgagca tttggtgaag gtcctataga 
114601 tcttagattg cttaaaccac gtttcggcac 
114661 attttgaagc tcttcatctt tcttggcaat 
114721 caaatggtat caagataagt ataactttgt 
114781 attttgagag agatagaaag aaaataagca 
114841 cctctttatt acttttagct gcacctaatt 
114901 gagtgctaac cgtctcagca taagaatctc 
114961 tcttagcctg tcctcctaca agacaaaaca 
115021 tcaacataaa gccaacgggt atatcatgca 
115081 cttcaaagag ggttaagaaa gtacccaaag 
115141 ttctatacgg cacatggggg ttcttatgcg 
115201 acagtgattt atttatatgc tgagcctcct 
115261 cgaccctctc acttccagca agatcgacca 
115321 tttccacgtc gacaccacgt acatgaacag 
115381 tcatttctat aatcctgagg aagcgaggct 
115441 aaaccagtgt gatctactac tcctttcatt 
115501 gagcccaata ttcatcagct caagcacatc 
115561 atctgggaca gctaacccat ttggtaaggc 
115621 aaggaaattt tctatggaag cagcatatgc 
115681 gaatgtacat attccaagta aaatttagca 
115741 attagaacca ccatcagaaa gtatgtcacg 
115801 aacacctact tcatacacaa cagtgttttg 
115861 cagagctctg taattgacac cccagtgttc 
115921 tagcaaaatg gaaatatgtg cgttagagag 
115981 caaagtcaag catggcgttt accattgtat 
116041 caaatataca cacattataa ccatcaagaa 
116101 aaacctcctc tgcattaaaa aaaaacagat 
116161 taacaagcta atctatatac ataagaggaa 
116221 cgaaaacttt attgaactta aacaaccgat 
116281 caaccaccaa ttcaccgttc tcaccaatgt 
116341 ccggaaggaa tggtcttatc cggcaataga 
116401 agaagaacag agtcatatct ttaatggtcg 
116461 tgttttacct ttcaattcct gcacttcatt 
116521 atggtaattt tttgctgcat cagccactcc 
116581 aaatatgaag ccatgtattt cagactagaa 
116641 gacataaatg atacttacca tagtaattta 
116701 agacttcttg ctttatagac actgaagtag 
116761 atgtcattta aaagaactac cagggtattt 
116821 acctgcaaag caccaaactg gttgtctatg 
116881 catttttgag atttagatag gtacaacttc 
116941 gcaacttgta actctgcatc ctttaattta 



agagcacaaa cgatcaaata tcttcgacta 
tcgctttagt tccataatag aaacatcaaa 
gacggcatta agttaacact tggtattttt 
agacatcgta tgctttttag ttacacaatt 
tctcaaatct tcctcccccc cccccgtcct 
ctgttgagtg gtttcattga agttgaagtg 
aatcaacaaa aggttagatg actcacgcag 
atttccattg tgatgcaaac taaagaagac 
tagaatgtaa ctatgcataa cgatttggtt 
gcttatgtta cttgttaagg tcttcgagga 
tctggtggtt tggcctggtg gtttcaagat 
tctgctttta aaaataccaa aacaaaacaa 
attgttctgt taaagtgtgc tagtagaggg 
catctttgct gatttctcga gcttctcagg 
ttcaagaggc ttcacggttt cagggaaaag 
atcagtttct gttcccatcg aaagacagct 
atctgcatcc ccaaggccaa tgagttcaat 
accggcaaac ttagatagct ggtgaagatc 
cggtgaaccg gaatcagaat gcttgctact 
tgctctccca aaagaaccag gtgcctttcc 
gtgtcttcta ggtgatgaag gccccaacaa 
ggttgtactg ttactttttt gcttctgaac 
aacatccttc aagcttgata cctgttacat 
tctctattta cttccaatgg gcaagattaa 
cctgttccat gaggtgtcta acatctcgtc 
ccacaccaga aactctttca gcgaacttca 
catcaggatt gacttgaaca aacataagag 
aaaaagtcaa gacaaagggt taaggaacca 
atacgtgtag cccaaatctt aaaacttttt 
aactttgaag gacttgcgtc aatttgctgt 
ctagagcaaa tatgacatct ccaagggctg 
tgagcctctc tccagttacc tccgagcgat 
agtgcaaact accacgcaaa acagaatcgg 
aaagaacgct gtatacaata ggcaacaatg 
tagataatgt cagaagataa aaaaatttac 
gagagctgtg gcaccaacgg ttctgttcat 
ttcagttgat ctcacagaat gcatgcttgc 
agtattccaa acccctaatg tacgcaagtt 
aaacatatag ataaatatgt aagcatatgt 
acaatattag cttagagaag gatatcttcg 
aacttgctca ttgtatatct caaccatttg 
tctaagctga gttaagtgaa acaagtcatt 
ttttgaagtg atgcttggcc cactctgtgt 
taataatgtt tggctgaaac ataaaaggag 
aagtttttcc agatcccgtc tgaccatacg 
ttgatcgaat taatggtcga gtatctagga 
ttaggagaaa ttaatattac ataaagtcct 
ggaggtttag accttgagtt gctgcttgac 
gggtatcttt cccttgctta aacggatttg 
actctataga agtctgtcca ctgttttgcc 
ctctgatatt tcctggaaaa taaatgcaag 
agtgcaaacc caaagtacac taacccaaag 
gtacagtctt cggttttctt caaggaccac 
tttgagcttt aaaccttgga aaaatattac 
aaactgacca aacagccata caaaattggt 
ggtcctcaaa gtatttcttt tgtgtcctta 
cattcaaagc cttcatgatt cacaaaaact 
attgtttaga atactatatt ttaaagccaa 
aagctttggt aggtggactc tctgttctcc 
tcgagttctt tgaccttcct agttgattcg 
ctctccaact caactgtact gttttgtgct 
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117001 tttgcttcta actccaagca ctggtttgca tgcgcctctt tcaccacctt gagttcttgc 
117061 ttcagcttac gaatctcagc atcactgcgc tctttttcct ttttcaaacg caccacatct 
117121 ttttcttctg acttttctct ttcttctatt ctgctttttt caagctaaag agcacaacaa 
117181 acccattatt gggaaaactc tgagctaaaa taatttaata cacaacacaa gcggaatgaa 
117241 aatcgatgca tgagtcacgt atttaagtaa aaggggcata cacaacgact tatgatataa 
117301 tttcaacatc tagaggattg gcttaaacgt caattttcac aagtactatg caggttaagt 
117361 atgtattaag gagacgatac atattgaatc aataacatac ctttgtacgc tccatacaat 
117421 tcgaaacaat ctacaacaaa gaaaataaca gaaagaccaa tcaataagcg aaatagaaat 
117481 aaactgaaat cagagacaac agtacctcgt tttcctgagt tgccccagct gccaaggttt 
117541 ctaagacctt tattcttgac ctgtatttat cttcacgtac cctgaagagt atattttgct 
117601 gcacatatca taaacatgtt tcattacttg gtcagaagga ttatattaac acgtgcatac 
117661 aaagaaaaaa agagtattat gtactcttac gtttttcagg ttatcagctt gatttgagat 
117721 tctctgttct atcacttgca caagtgcgct caagagtgat gccattgcct gaacaacaaa 
117781 ccgcaaaagg gagaaacatg tggatcataa gaagaaatgt tccatgaact aaggattcaa 
117841 attttaggca aaccagatta aagttatcaa gagaaggaag gcatcagtag ctaaatcttt 
117901 gtcttacatg agacacatgt ccattcatct tcttcacgct ctcatctaga agtttatcca 
117961 gcatgtcaaa tagtgtccga gtaggagcat tctacaaaca aagggataaa gggataaaag 
118021 ctaaacggtt gtacacagtc tactaaaaat aagttagaat actttactcg taaactgtca 
118081 gatttcagta attctgaaat tttagcatct gatgtatcaa tctcaaatcc ttccttcgac 
118141 tggaatccat caataaagtt gcggtcatct cctctggaat ccgaatggtc ttctggcaag 
118201 ctccatctcc tcctcgcacc tagtgagttt ttatcattac caccatcgga aaaacttgct 
118261 ttaagggcct taagggactg aaaaactggc agcatatccc cctgcatttt tactttctat 
118321 taggaggcat aactccataa agaaagacga gagacggcag agagaagaaa atatatgggg 
118381 gcaaaaatca tggaatctta tcaagaatag caaatcagac aatagccatg atgagaacag 
118441 cgttgcagcc agatgatgtc aaaaacaagg cttgttttaa ctgttaacaa aaattatcta 
118501 ccctgaattg gttcggtttt ataaaagcca aggtggtaaa cccacaatct ctttacctat 
118561 gtgatgcaat aatactatat aaaccaacca accagatgtt ctctaatttt caaatcattt 
118621 tttctacaga aacgaaaagt tatatccata gaacaaaaca aacctgttct atgtctgaaa 
118681 cctcgaatct gggcagggcc atttcatcca tagcagtcag aaaccgctca actctaacat 
118741 aagcaggctc aaagctgcct ccctatagaa tagccaaaga gtgtgagtct tagcagaaaa 
118B01 ataaaaaata gagaaaaata aaaacagaag cttagcacca aaaaaagata tataagccaa 
118861 atggtcaaca aaataaaaat tcataagtat accattctca ttgaaccagg actaagctga 
118921 ttcaaaaggt tacacaagac agttccatcc ctcaagcatg ctctcacttc gtcctccgaa 
118981 gcttccgatg gtaaattcaa ataaggaagt gtctgattta accattccac caaattctga 
119041 tgacctgagg gcaacaaaat gaagaagaaa tatacaagaa cggtgagaac agaagccatc 
119101 ttccattcat gtctacacct aaaatgagct taaacttcgc cattttctta taatcacatg 
119161 aataatctct gtcattactc aaagttctga gagataaata caagaaaaag ggaatagatg 
119221 acgacattta cctttcttac taccatccaa gcttgaacga ctttcagacc cgtcgaagct 
119281 gagaaaacta gttacatcgg agggagcata gacgccactc atattggcgt cctgttcgag 
119341 attaatgttc attccaactc agagatcacg ctcagttccc atgattgtag ttgagaggag 
119401 gagaccctaa agcaactaag aaagttttgt caataaaaac aacacaaacg tactaattcg 
119461 ccgtgtaaaa tattgaaaaa ggggctccct tcatcaatct aaccttctaa tataatcaaa 
119521 ctaagacgtt tcaatttcag ataagaccgt ttaaaattat aatcaatgat gatgaaagtt 
119581 aattcccctt gtgttaacca gaatgcggac caaatcaatt acgcagatcc aagccatgaa 
119641 aagagaatta aaggatacga acctgagaaa agctaataga taaagctctg gagggtggaa 
119701 gcaacgcaaa agaaaaaaaa atggcgactg gtactgaagg aaaatgctga gattcacggc 
119761 gaggaagggg agagaaggcg tttgaatctc ctcgccggga gagaaatttc gaatttttcc 
119821 agccgcagcg attttgctca ctcttttcct cctttctttt ctcgtctctt cctttttttt 
119881 tagattccga gaaaaaacaa atgctaatta cggttccaac ttcgaaaata gagaaaaatg 
119941 ttttttcttt tccttagaat taattttaaa aaaaaaacat ttacaataca atttatgttt 
120001 gattttgctt tacgttatag atgttttggt gggccctacc cttttttcag aaaaggctct 
120061 tcccgctttc ctcactccaa aatttttata taaatttttt acataaatat ccgattgatt 
120121 atgcggccca attaagccca cgtataagcc cgtttaactt aaaatgtcct tcaacttgtt 
120181 gatcattcta cctaaatttt catgtctgtt ctttatattt gtttactagg ataatatttt 
120241 tggtgtgtat gatgcttttt tatttgattt atgaggagga actgtagtta gttataaaaa 
120301 tgggttaacc agctctagag tttttctttg ccaaaaaaaa aaagctgaaa attatgaatt 
120361 atgaatctgc ttagcttcga tcaaaccaac gaggcatagt tatgacatgg gctttaaccg 
120421 tgctttatat attcactgat gagaagaagc aaccctcaat atgccaaacg tgtctgttgc 
120481 ggagaggaag tctcagctta tgtgatccat attgataatt tgatatggcc caaacatttg 
120541 attaaaggac atacagaggt accatatata gtatcatttt cttaatactt ttatgttttc 
120601 atgcatttgg gggaagggtg tgataattgt atgaaacaac cttaaccaca gttcacgaaa 
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120661 gagttaaagc ctacgttaga gaaaatatct aagagagacc ttcgttatct tttgttatag 
120721 tactttgtaa catacataaa acatagtaca ctgtacgtat tattctctat atcaaattca 
120781 tcttgcactt gcactttcaa ttgatatcag agagaatcac tgaaatcact ggtaagatct 
120841 tgaagacaaa atagatgatg tagtacagta ggaagcatgg tcattgaaat tgttagtgca 
120901 ccaaaaacag aattttagaa gttaaaataa aaaaattact caaggtgttc aaataccttt 
120961 ttttttgtgc acttttcatt taccagtctt aaaatactac ataaattagt acaagaagta 
121021 tactacagga agcgaagatg tttaattgac ttgaagacta gaattggagc cggcctaaag 
121081 ttgaactaga agttacttag agaccataca atcaaagatt tactttggag aaaaagaaac 
121141 tcatcaatgg aaatttttca aaaggctttg aataagattt ggaaaaacta taaaggagat 
121201 ttgtggaaat ttatcctttg gatagttagc tatggtgagt caagaaatct acggagtata 
121261 attatgtact agatgtttgt ccgcgacttc gcggatgata at at tat at g aaaagtacat 
121321 gttatgttta taacgttata gtgttataat agttgaatag taattagaaa ttttaaattt 
121381 tagaatttaa tgttgacaat ttttttttag aatttaatag tttgataatc aaaaatttca 
121441 gtatatattt ttagaagagc tgtgaagttt tgagatagtt caacaacaaa aatgtgtcac 
121501 aaaaatattt ttcttaattg caagtattgt tacaattgat ttaagtaatt ggttttaaat 
121561 ttccttttag tttattttaa catgtgaaaa aacaaaaaaa aaacatttta gccataaaaa 
121621 tatattttgt gtagttccaa aaatatggtt agaggtctct atagttctga catcttcttt 
121681 gatgattttt tatgtttctc tttgtttgta tgattgtttt caatttattt taaattcagc 
121741 aaaaaataat tgttttatgt gtttctttca aaaaatctgg caaaataaaa gttcggatgt 
121801 aactaaaaac tcagatttat tttataaatg agttaggtag tctttccgtt tcaaaatgtt 
121861 acatgttttg tattttctta ataattgtgt aaaaacctta aacatcactt aaaatgatac 
121921 agagggagta ttaaatttaa gtattaaatg cataactttc agtgattatc gatttttcat 
121981 gattctaacc aataagaaat cagtaaatgt aattaatttt ttttgaaatt ctcaattaat 
122041 cattattaat tgataacaat tgtattgaaa atataaacca atgtatcttt ctgaaacaaa 
122101 atttttttct tagaatatgt atctttatga aatggaggaa atattaaaca gtatgtaaat 
122161 caaaagtcta tagataattg attgtaaaac taatttttat tttaacactc taaaattatt 
122221 aaaaactaaa atatcaacta ttgaaattgt aagatttcac caatgaatgt atatgtaaca 
122281 aaaaatcatt tgggttattt atattttaat atataaataa acagtgtgtt ataattttat 
122341 atgatttaaa tgacttattt atttttaagc ataaaacgat aagtcactta ttatatttta 
122401 atagataaga tataattgtg cgataattta tatgattttt aaatcattat aaatgtacat 
122461 ttatacagtt ataattagat ctagaaacat ttaccatttt gcatctaaac aagatatgct 
122521 caagatttat atgaataatt ttgataaaag tcttaaatga ttcagaaact ctaataaaat 
122581 gattttatag gccaaagtta aaaaaatgtt ttaaagtaat ttcaaaatat ttgtaaatga 
122641 tttataaatc ttgtgtgggt tttaaaaaac aaaaacaatt ataataatgc tctatataca 
122701 aatctaagtt tttatataaa aatatataaa aatattttaa ataattataa caataaatta 
122761 taaagaaaaa aaattgacga aatattatat attctttata taattatgca attagtgacc 
122821 ataaactatt atttgtaata ttacttgtgt tttttagtaa cttgcattaa ttgattctag 
122881 aattttcttt tatttttaga tttgaataaa ataaatattt aataaaagtt atcaaccaat 
122941 caaactataa caatttcttg aaacttcaca tatgattgac acgtcagtaa agtgacttct 
123001 caattaatat atatagggat aagtgtgtct tgtatcgata tcatttaggg atattaaaat 
123061 gagtttcctt aatgggtttt aaatggggtt ttaatataaa taaatgcaca tttacaccca 
123121 tttagtttat gtgatttaaa tgggtagttt tgttaagaca catttattaa atggtgtttc 
123181 acttaaaatt agtggtgatc ctattaaccc atttaactta atacatactc cctctgtttt 
123241 ttaaagatgt atgttttggt gttttcacac atattaagaa aacacattaa ttatacatca 
123301 tttttagaaa ttatcaaatt ccaatgcatt ttaaccaata gtctttcaat aaattcaatc 
123361 aattttattg aaatttgcaa ttttgtatag gaaacataaa aaaatacatc tttgtgaaac 
123421 aatttatttt tctaaaacat ctatctttaa aaaacagagg gagtatttat taattcgatt 
123481 ttgtgttttt ggcggaaaac ttgattttgt gttttgacta gaaacacaca attttacggt 
123541 tatgacatga agtaatgatt ttgcggcaga aaaacgtgat tttgtggttt cggcgggaat 
123601 tttgattttt cggtttttgt gagaaaatta gattttgcgg ttttttgtga gaaaagcttg 
123661 atttcaattt tgacggaaaa cataattttg tggtgggaaa ataaattttt cttattttgt 
123721 cggaaaatgt aattttacgg atttcgcagg aaaatgagag tttactgttt tagtgagaaa 
123781 catgattatg tggttttagc gagaaaaatc aattttaacc atgttctttt tgtaaaatgg 
123841 cggcagcgtc ccgcgtctta aaaatacact tgatcttgaa tgaatcattc ctttcgctgt 
123901 ttattgatta agacgctaac agtacggcca attaaaatta acggggattt gtgacgctgt 
123961 aaaatccagc gtcagaataa atggagatgc ggtggttttt tatgaaacgc agaagcatca 
124021 atcaaacgag gatgccttaa aaagctgttt cctttgtatt ttttttaacg ccgaacgctg 
124081 ccgctgcgtc taaaaaagaa cagagatttg tgtttttagc aagaaaatcc aattttgtgg 
124141 ttttcgtaca aaattttgat tttgaggttt tggctggaaa atctgattgt attgtttttt 
124201 tgtgggaagt ttaattttgc agatttggcg gagaaaagaa attttgtgtt ttggtagaaa 
124261 aacatgatta tgtgcttttg gtgagaaaat ctgattttgt gttttggcag gaaaataatt 
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124321 ttgtggtttt tgaagaaaaa tctaattttg cctaattttg atttaatcgc ttaatttccg 
124381 cgttgatcgc gaaaagttga aacacaatta attattttct actcattatt gacagatttt 
124441 gtgttaatat tcattaccaa gtaattacag ttagctatta aattcaaaac cattgttcaa 
124501 acccaaaaca tatacatact acatttgagg atttctttgt atcaaacaaa cactgatcta 
124561 aatgttttga caaatcaaaa ggatattgtt ttaaattatc tataaaagtt tatataatta 
124621 tgtctaaaaa cttgtgtcat catttttaga attagctaaa catattataa atatttaaat 
124681 tgtatttaaa attagacttc acgatatttt ggtaactcgc taggcacggt tttaccactg 
124741 gacaagcgcc taacgcgatt ccgaacaagg atatatcata ttaacatccc tgataccacc 
124801 ttaatcataa ggattttggg atttaattta agttgacaca aagtataagg gctaagagtt 
124861 tataaaggtt tttgggttgt gtacctcact accttgaaca tgtttagttt aagtgtaaca 
124921 tgtaaaaatg ggaattgaat tctatgaaat ctaatctctc gctgttgctc agtagttcag 
124981 tggtaataga gtgattgagt gttaactaat aattggtcgt agttcaactt aaactgagta 
125041 atattttttc tttgttaaga atactttgat tatttttggt tgaattaatt tgtgacataa 
125101 acaaaacata atatctgtat gtgttatttt atgtatcata ctcatcttac acttttaatt 
125161 ggtattaatt ttacaagttt taatgtacga tttacgtacg tatcttacgt aaatggacat 
125221 gtgcattttt ctattgttca tgttcatgcg agaactatca tccatagatc gtgcaaaaga 
125281 gtggtgggga aaaataactt ctttataaaa ttataacttt taattaatgc gataaaaagg 
125341 caagagaatg tagtttcgaa ttaattacac cacatcatat ccatcaaaat catgaaattt 
125401 agaatttttt agcaccatgc acgtggacag taattttaac ggaaatcaat tagtcaaaca 
125461 atcaaggcac ccagctgttc cttatgaaga aaaacaaaag cttttgatga aactaaaatg 
125521 tcattttaca gaacatatac tatatagtaa tagttgtagc tactactgat tgttgtcatg 
125581 actgtttcag agtgtagaac accatctgat ataaacatgc atgcatgtat atatgttcat 
125641 tcaaatggtg aaatgtattt gcgtacgcat aagcaaaaag acaatgaaac aagtcaaaca 
125701 acacaagact atgggtctaa ctggtgacca cctcgggaat agtaggaaca aaaaggaaaa 
125761 gaattgatgg gaataaaatt atgggaatga tgaggaacgg ttattccata tcaaatttgg 
125821 tgaggaataa atttgttctt taattctcta caataaaagg aatgagaagg aatgaaaagg 
125881 aatgactatt cctaatgaat ggtaaaattt gttaggaaca ttaaggaatg cattattcct 
125941 cttcattcct tggtcaccgg ttagacccta tatgtggatt atgcatatca tatacacgtg 
126001 aaaggtacct ttaactatca aattgatttt gtagactcct tcaaaccaat ctaggtcact 
126061 tcgaactgta gattatccct tacttattaa tacagaagca tttttaggat ataaccttta 
126121 ttttgtattt tattcacaag tttgccatta taggtgatct gtcattacca tagctcttct 
126181 cagctttcgt ttttaataac cctttgctta ctagagaaat tactctatat gccattagta 
126241 cacatatgac ataattggct tgataacata ccaatttaat aagatgggtt tttgttcttt 
126301 tgtgagattt gctcgccgat gatgtcgggt atctcgccgg atatctgagt ttgacgacaa 
126361 gtttagagct ctaaggttgg agagtaaagt aatcaaagtg ggtaatcatc ctttatcaga 
126421 ttatctgata aatcgaaaat gtgagatttg agtggaatgt tttagggatg tttacggaga 
126481 gattcgtgtc ggagatggtg agagagacga ttttttttgt gactttttcg aggatgtcaa 
126541 agagacaaga ggtggtggag acaggggtgg aagagacagt gagatcagtg aggttaacga 
126601 gacgagcggg tagagtcatg agaggccgtt gcggcagcgg aggaatgaag agacggcgga 
126661 gaaggagtgg agtgcgtcag ggaggcgatg tgcgagtagg tgggagcttc gaaggttact 
126721 acggcagatt cgttcggtcc tgtcgttttg ggtggtacct ttgatcaatt gctcaatgga 
126781 catcagactt caaggttaat tgttgttttc agtctgtggt ttctgcttaa gaatcgaact 
126841 aatttgtcga aatatttgtt tgtggtgatg gagaaatcag gcggtggaga tagccatgga 
126901 tcgaatagta gcggggtatg tgatatcccg atgttgatga acaaacaaat aagccatctc 
126961 tccctcttga atctgctctc atctctccct tttcgcctag attagtttag catcctacac 
127021 catttgatat agcaagtcat tcgagaacga ggaatctagc aacttatgtg aggttgaagt 
127081 ctcttttatg gctaaaattg attgtactga atgtttctgt ggtttaaaac tgaaactgtt 
127141 tagctttgtt gatgagctgg aaagttcaag gtctcaaaga cattaaagag ttcaattccc 
127201 tgcttatgtg agattggagt ttaagtccag ttgccttatt agttcttatc atgagttgca 
127261 aagacgtgtt attgtgatct caaggcaaaa ggaagactct ttttttatcg tcttatatag 
127321 tgagtttgta aagctgaaac ttttatgcat tggtttggct tgaattcttg ctgtctgaac 
127381 tcactaatgt cttggcttga attctcagcg agcttgctgt ctgaactcac taatgtcttg 
127441 gcttgaattt tcagcgagct ttcgtctacc ttacttgctc tgtttttttc ttctttcatg 
127501 gcagctctgt ttatttttct tctccttcaa cacgttacca tctaaacatg gtttttaaat 
127561 aaatattttg cattttgtta tcctttcata tatcaaaatt atcaaatcga gttgtcatta 
127621 ctaagatggt ttcttccagc tatacataca catgctatag aacacgactt ctttctcttc 
127681 aagacatgtg ttgattatac tatacagtgt ggttctatat aattcttttt aattaagata 
127741 tcagtacttt acagtttgat tatctttcta gcaagatgtt acgagggtga gatattagca 
IVJtl 1 attct 9 aaaa tatgcatttt cactgttgtt atactcatat gtataatcgc tttatcttgg 
127861 tcacagaata aaaatactta aatatgaact catacatttc attgtcaatc gtgtaatgtt 
127921 gcatataaaa catacgaggt tatattctaa tttcaaaaaa gtacaaaagc aaagtcgtga 
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127981 gtgatcagat cattgtgaat caggagtaaa tcactatcaa aggatcaata gttatataaa 
128041 ccagcagcga ttcacacata cataccattt tcatctcatt gatgctaggc ctaatacatg 
128101 ttactttaaa aaaaaatgga aaatgaggtt tggtatataa tatggccttt tggtaacata 
128161 ccgtcaatac atgcattcgt tagttatgct taaattttac ggaaatatta aattgtgttc 
128221 aaagtgaatt tatgtgaggt gaatataatt aagttgccat tatgtttagg tatattctaa 
128281 aacaaaatat tggtattcat gtgcccagaa aatctcgctc acacaacaaa gagatacact 
128341 tatggtccaa atactataac gtatcaatct aaattaaaaa aaatctcaaa gttgaaaaat 
128401 attctgtagg aatttgataa tgtattccaa tttcttacac attaggtatc catatcgatg 
128461 tactaatata tagaggtgtt cagtctgata gaactaacca attaaaacca aacaaaatcg 
128521 aagtagaaaa aaaagattta gatttgatag ttcaaaagaa gtcgagccga tgttatcttt 
128581 aaaaaataac accgaaaaat tgattaacta aaaaattagt atatttatat gtaatttttg 
128641 taataaaatt aacattaaat aatatataca aatttatttt attaatatgg tattcataat 
128701 caaaacattg attaaatatt tttttatttt tttaagttaa aatggttatt ttatttttca 
128761 ttttcatcaa attaaataaa acatattttt gtttttattg ataacttctt gtgataaaat 
128821 ataattatgt aataatcctt tggtattatt actttttact tttatttgaa tatttaaata 
128881 aaatatatat taatgtttta ataaatctga tttatattca tataatctaa tttttttaaa 
128941 aataaatata tttatggttt cttggtataa aaccgaatga accaaaaatt aatggtatat 
129001 aaacgaacca aaccaaatta tatataattt caatatgata gcgaattttt ataaattgaa 
129061 aaatcaaaaa aaaaaaaaac tgagatagta tttaattctg gaggcctgat taatattata 
129121 aatatttgtc attaattctc tgtgtggtta gtacatacta tatgagcaag ttatgcaatg 
129181 cacctttttc tgtctcattt agtttactgt agtacatttt attgtacata atattgtaga 
129241 tgttatcaat aatgtattct gcaaactata aatgaatatg taatgttttt ttagtaacac 
129301 caatgaatat gtaatgtaac ttatgctatt ttacaatagt ttcattccgc gcatagcgca 
129361 ggtctctacc tagttctatt agtactaagt aaaccgaata caggtgtgga cgtcacatta 
129421 ttacaagcta agcttaattg gttaaagtaa ttaagttcgg ttggttttag gatttatgat 
129481 agaatcatag agagtggctg gtgctgacaa tgttattaat gcttaattaa taagctggac 
129541 caccaaaaat ggactgttcg ataatcaact gaagactctg atccactcaa taaatgcgag 
129601 catctctttc attataaaat aggttttatt tatttttttg gaacaaaaat aggttttatt 
129661 tatcttcaca tgttcttaca tccaatgttg gaatatatca ttttatcttc acttatgtct 
129721 gcgccatgtt atttgtaacg caaaaaaatt gttctcttca ctggttatag ttaacataac 
129781 atgtaacaaa taaaaaacat gtaaaataat tacatattta gctgcctagg aattgggggt 
129841 atagattaaa tattatttat aattaaatcc tttataacaa aagtcaaagt gatccttttt 
129901 ggagtgaaaa ttagaatctc tctgatataa aatactttgt ggatatctca actaaaaaag 
129961 ttagaaagga actgagtatc tcaaaatgag ggatttctca attagatatt caccttagat 
130021 acttttgcaa taggattact ctcttagcta gaatcatgtt tacttgtcct cgcgaggacc 
130081 acaatgtaca aataacacgg atgacatgtt catacaaaat tatgaataat ctgtattagg 
130141 aaattactga aaaggttatt ccttcaattt cacattaatt gtcattctat aatttaaatt 
130201 ttgtttaatt ttagtacttt ttatgtttta atgtatttat ttgttagttt ttctaatttt 
130261 gtctatgtta tttttgttta tcaattaaga taagtcaaat aaatactctt acatttgtga 
130321 ataaatgtca ctttgatatt tttaacacag attaaaaaaa tgacaaaaat aaatataaat 
130381 tattattaat tacacatttc tgatcaataa tatttgagat aaataaaatt atttataaaa 
130441 tcaatgcagt ttacaattaa ttttcaattg aaagtaaata taatttgtat tgaaattgta 
130501 aaatgatatt tttgtgtaac aaaaaaatta gaatgacaat tattatgaaa taaatggaat 
130561 caatgactca ttattatttt ctgtacatct aaactagtgc tatcatgcgc tttaatgtga 
130621 cggaaaagta tggcaattag tacgagaacc tctaagagca ttttcaacag accctcaaat 
130681 cctcaaattt taaggttttt gtttcttcaa cagatcctta aatcttcaaa ttttgaaggc 
130741 ctgaatagtg aaacctcaaa tttgaggttc cactattcag gccctcaaaa cactatttac 
130801 ttttcagttt agttcttata aatttgtaac ttacatatat ttgtactaat attaattttc 
130861 ttattatttt aatccttgta actatatttc acttaaatct ttaaatatat cttatatata 
130921 cagatatata tacatatata tacatatata ttattacata aaatatgaaa atatgttaaa 
130981 actaaaataa aataaataaa caaaaaatat tacttaaaag atatttatta tttcacaaaa 
13i04l catttatatt aaattgaaca atatattaca aattaatttt gtgttataat aaattgtatt 
131101 tataaatatt atgttttata atttcagtgt gtatgtacta tctatctatg aaaaatataa 
131161 taatgatagt tgatttatat taaagttagg gactgaaatg caaattacat aaaattttaa 
131221 aaataaattt gaggatcctt gttggagtaa cccaccttca aaccctcatt ttgaggatca 
131281 ttggccctta aaatgaagtt ccttgttgga gatgctctta gagcaacttt atcggcagca 
131341 aaatcacgta tcttaaaggg ggggaccacg tactttttgc aaaacccatg acagacgttg 
131401 gttaaataag agacgtttgt gctgcgtttg ttaacctgtt tgcggactct actgacacgt 
131461 agtgatccgc gactggctca attattatta ttttttttta aatcagaaaa atgcaaaaaa 
131521 aaaattaaaa ataaataaat taaaaaccta acaaacgtat caccgataat gatgccctaa 
131581 ctgactaggg gtagtatatg gttcagtatg ttaattcaaa cacattaaag atttagaatc 
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131641 aaatgataaa taatactatt tggccgtgta atcatacaag aaataaataa aacagtgaaa 
131701 aaaaccttac tcctttttca aggttcagac acgacaagaa gtagtcctag ttaggggtgc 
131761 ttattttgag agatgcgaag attttattaa aaacatacta tagcttacgg tattaaatag 
131821 ctcttggttt cttgcacagt gttttaaata tctcatttaa atacatattt ttgtcaaaac 
131881 tcaaacatat acaatcacag ttgattctct tcctgactct aattaatttt caccaacatc 
131941 tttgatccaa gcttcaatct ctctcaatat gcaaccttat tgtttgactc gcatggaaca 
132001 gagtctcaaa taactttagg aaagtttgag caagtgttaa cattcgaaat caatgcagaa 
132061 actatatgaa atgcaagctt ttttgtctgt ttcccgggag atccttgatc tataacctta 
132121 aatttcgtgt tcttcgtcgt gtgtagccat tagaatcagt tatgtgtttt tttttctttt 
132181 taagctacca gcattaattt atattttctt gtgtatatac tccgctccta acaattaatt 
132241 gttgtccttt tggcacatcg ttaaaattgg aacgaaacag aaaaaattag cataaccttt 
132301 gcgcaaagat gatacgcaca aataaaaaaa tagtccaaat ttttagttag caaaaaaaac 
132361 aattaaccgt tgttgttggc caaaataagt cttggttgga tttatatgaa tgttatatct 
132421 atattattaa atcagaagta cacatataga atgcccttta gtttttagta ttatttacac 
132481 ttctatgcca ctaacgttaa ataatacttc ctattttaat gctgtctttt ccactttgat 
132541 taatgcattt tctaaaatta aatttgaatt aaatacacaa ttaaataata cttcctattt 
132601 taatgctttg tattttccac ttacattaat gtattttcta aaattaaatt tgaattaaat 
132661 atatttcatt aacttttcaa ttaaatcaat ttcaaattaa aaaaatacat ttttgttgga 
132721 caaacaattc atggaaatta taatatcaac tcttcattta acaaatctga acgaaaagaa 
132781 tattatcaaa tttgaaccga aactagatac tatccaaacg gatttaccat tttggtatct 
132841 agagaaccat aaccaaacct tatctgaacg aaatatttca gagattcgaa tgtatttaaa 
132901 tcatatttat atacttcaat atgttagcta ttttcgagtt aatatccaat atataagtta 
132961 ttttaagttg tttaaaatat ttgaaatata aaaaatagtc gaaagtaaac atctaaagta 
133021 gataaacaat aatcaaaaca ccaaaatact taaaatatat atatttatta ttcatccaaa 
133081 tattcaagtt aaacctattt taaattttta atttaggtat tttgacttac attactcaaa 
133141 tttacatgtt atatttttta agatttaggg atatttaaag atatataaat ttaaaaaatt 
133201 taaaaataat ttaaacgggt tatcaaaccc gcaaagatcc gaataaaacc ggaaccaaag 
133261 tttataaata cctgaataga gctagaatct ttaaactcaa aatttcaaac ccgaatagat 
133321 tttaaccgaa ttcgagtgga tatgcaaata tgcatcccta gcttagtcaa tataaaacga 
133381 tcaaatatta caaatatact atttagtata aataaataaa aactgaaaat taatacacgt 
133441 gcggtcgcac gggtcaagat ctagtttatg aattatgata ctaactaata gattggtcaa 
133501 tgccctttat atattatttt aaatctgatt atatattttg aaataggtta ttaatacgtt 
133561 cttaagaatt atggttttat actactttaa cttgaactga attccattct acaacacata 
133621 ataatatact atatttaaac cattcacaga tcgtggtcaa taaaaccata tataattgat 
133681 caactaatat attgtggtca gatctaactc gtataccctg ttaaaaaaat gatcacatat 
133741 gttttgattt catgttaagt ttcatccata aaccaatagt gggtaagcat gtgtgttgat 
133801 tatagatagt gtaagtttcc ttagtttact taatttcgaa tgtataactt ccactcaaaa 
133861 gtatagtaca aagagaaaga acaacttagt ttttcttttg acaaataaaa tacaaataga 
133921 gagaggtgtt tggaacacaa tattcaacct aaaggcaagt acccacttct tttgtttctt 
133981 aaagattata ttgaacatga accaagttaa gatatctcca atgataactt catttttttc 
134041 ttcgaaattt gaagtcattc aaaaaatgaa gcaacaagtt tttcaataat ttgcttcatt 
134101 ttcttcaaaa aaatatattc tatatattct ctatcactat tttatattta attattagta 
134161 acactttata cttttcaaat ttactagttt tactcattta ttttgtttta acaataatcc 
134221 gactctaata gtaagttaaa cattattata tgatataatt attacacatg acataaataa 
134281 gtaaacatac acttttagat tttataaaga aacaccaaat acaaatatct taaatgatta 
134341 atcagaatat tataaaagcc aaaattgagt ttattttatt ctaatttctt ctaaatcaaa 
134401 catagaattc ttaaaattta ataatatcat gtctttcgtt attaacatct aattagatga 
134461 aagcaataat ataaattttg aaattaaatg ggtggcattt aaacatacaa aataacaata 
134521 aattaaaatt atagatcagt ttgtaaacaa aaaaatatat tctaaaaatg aacaattgaa 
134581 cattattttt tcaattgaag aaatactgtt caattctcaa aattttaaga aataaagaaa 
134641 aaaataaagt accactacag taaaaaataa agcaaaaata taaaatgaag tactattgaa 
134701 gatgatctta cacaacatat gatataaagt tggcgagtta gctaattccc aaagattaaa 
134761 gccaccgaaa aactacattc cgactatttt ttttgttacc actcctttaa aataaaacta 
134821 cttttaaaaa ttcctcgtat atttatgtat tttatgataa tataaagtgt ttgtttacct 
134881 aagaatgtat ctaattcacc tctctattta catgtataat agcatttaga gataaaatat 
134941 tattaaacac attcttattt cttttacgaa attataaatt gttatttttc tttatttatg 
135001 gagaaagtaa catttctcta tatttagctt ttattttgaa attgataatt tagagaaata 
135061 tatcttttta taaaaaaaaa tctatgttat aaaaagaaag taaactatat tggagaatgg 
135121 agatggtcta agtgtcataa agaaattctg agaccatcct ttcttttttt ttttgatcaa 
135181 accagacttg catttatctt aaaagagttt aaactccatt agaggaggat acaacaaggc 
135241 tacaaggagc caaccaaagt gataaactga taacacaaaa gataaaattg atacaaatca 
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135301 caaaaagaaa ccataaaaat cgcatgacaa 
135361 caagtgtaac ccaaacggtc acaaacactg 
135421 aggaagatcc tttgagtacc agcaacgctt 
135481 atattgggaa gagaaacaac tccaaacagg 
135541 tcagattcaa caggcaagat atgtcaacga 
135601 aacacaagat aacagcacca aaaaccaggc 
135661 taagaaatcg cactgaaacg acatagatgc 
135721 aagctctgtc catcagaaca ttaccaatta 
135781 atacaactcg cattccaacc cattcaactg 
135841 gatgaacctg gaaggctgta accggagggt 
135901 cagaccattc cacgagccga caaggaaacg 
135961 agatctttgt cgccattgtt gaacaatgaa 
136021 gaagagaagg cttagtggag gtgacgcgga 
136081 ggtctggagg taacggcgga ggagcgacgg 
136141 acggcagcgt atctgcagga ggtctgtaaa 
136201 tggaggagga gcagagctgc ttcaaaggga 
136261 gagatccatc caaaacggca tacctcggga 
136321 ctgagaaaag ttagcagttc tttgagacca 
136381 caatagtaac ttcaaaatac aaaatactat 
136441 aataaaaaac actatttaac tatttttatt 
136501 aatgttaatg ggctttgttg tcttctccct 
136561 tcaccaactc ataagtcaaa ccctataatt 
136621 atggctaccg tagaaaacag aatgatgccg 
136681 gttcgaaaca ttcaatggag ttacggaatc 
136741 tacatctcct aatctttacc ctttcctctg 
136801 atattgagac attgttgttt ctaatcaaga 
136861 ctttcattct ttctcatttc aaactttatc 
136921 tgttggagtg gggagatgga tactacaatg 
136981 catcggaagt caaagctgac cagttgggtc 
137041 acgaatctct ctccctagcg gagtcttcaa 
137101 cttccgcctc ctctttgtct ccggaagatc 
137161 gcatgtcttt cgtcttcaac attggtgaag 
137221 tcaatcaaac cattttttta aaaaaaacga 
137281 attctaattt ttgattaaaa aaattatgta 
137341 atgtatacta ttacaaaatt tcaaagatca 
137401 tagtcaaact tacttataag agtactttga 
137461 tattatcatg tctccttcag ttctcatctt 
137521 ttgattgtta aaaacgcaat gtttttacgt 
137581 tgagttataa atgatgatat atagacgcat 
137641 atactattaa agtacaagca catttggatt 
137701 ttcttttgta ttcattaatg atattttgga 
137761 tttggtttgt tattaaccac cggtttccta 
137821 ttgatactgc atcattttaa tatttttcca 
137881 tctcaaacta tttaatggtt tggttttaaa 
137941 ctaacaaaaa ttatttataa aaaataaaaa 
138001 ggtaaccgga tttcgatttt aatagttttt 
138061 cccgaactga atttatcttt gatcaaccgg 
138121 tttcaaaaca ccgatcaaaa ctataaaact 
138181 ttcaaaatta aaaacctaaa aatacatgta 
138241 tataaaatac atatatgaga cgaattatat 
138301 gtgattatat aaaattttca cctaacataa 
138361 tagtaaaatg attaaaaaac caaactaatt 
138421 ggaggagtgt tggcgaacgg gcaaccaata 
138481 aaagtcttca ctcgctctct tcttgctaaa 
138541 tggaaatgct aacgctataa ctaaaacgtt 
138601 agtgcttcgc ttttgacagt ggtttgcttc 
138661 acgaccgaac atgtaagcca atatttttat 
138721 cataggagtt caaaccaatc aacatgcaga 
138781 tttttgttca aatatcctgt taattttttc 
138841 gtatttctag caagtatatt aaacaatata 
138901 acataaaaaa gtcccatctg acaaaaggtt 
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cgatttcata agcttcttta ataatcgtac 
catcaacagc agaggggagc attagctggg 
tatcttccat ttcactccat acaagaggaa 
agacaattgg gagaccaata cccgtcaaga 
gttttccttc cagctgcaac aaagaagaga 
ttgacacaat aaaggaactg catactgcag 
cactaaaagg cacaaaggta gaaccgaagg 
aaaccaatga cctcaagttc aaagtcgccc 
aataaaggta agcggagaga gaggagctat 
gggaaggaag gtcgggacaa gatctgagaa 
acctgcacac aagagggacg ccaccaccac 
gctttaaatc aagggtgtgg gtaagagctt 
tgcgaagaag agaaaggaga gcgtctggag 
gagagagaac ttcaaccgga ggatactcgt 
ccaacggagc agcattcacc atcagggcca 
gtctgaagga gcgacggagc tgaagcggtg 
cgcgggcctg agagaaataa ttagggcgaa 
tcctttctct ctcgatcaat tcaaccacaa 
ttaacttttc tatttttaaa tttttttatc 
attctacccg tatatggaag gttcaagata 
tcccgaaact gaaccagtgt catgtaaaaa 
ccgttctgct tttttccttg tctacaattg 
gaaaatctaa agaagcacct cgctgtttca 
ttttggtctg tctctgcttc tcaaccaggg 
aagtttacag ttttcttggc tttctccacg 
actgacacaa aatatctggg cacatttcaa 
tgtatgttgt gggttaaaca taaaaaagac 
gagacattaa gactaggaag acggttcaag 
ttgagagaag tgagcagctt agagagcttt 
cctcctgtgg ttctcaggtc aatagacggg 
tcaccgacac tgagtggtat tacttagtat 
ggtaaatctg tctttctcct aatcccatca 
cgtgacccta acttttatta tgtactcttt 
ctatatttct ccttagaaaa ctgatagaaa 
atgtattttt gtatagatat gaattaagat 
ttttgtctaa caaaaaagac aaacatcctt 
ccacgatagc ttattccgac atggacgagt 
ttgcgtgttt acaaagtatc aaccttaata 
ttttgtcgtc cggttaagca taaaacatct 
tttactcagt ttacccctat taaagaagct 
cattctgtat tggtttcttt tttaattgtt 
attcaatttt ggtatgtgat tattccgtgt 
accggacatg tttgaaactg catcgttttt 
ctgctttttc ctaaatataa tcccaactat 
ttgtttattg gttcaaccag tggttcaacc 
attggatttt ttaatttttt ttttttcaaa 
taatccgttc aaccgcaggt ctaagtcgaa 
gttatattgg tttatattta aaatatctaa 
taatatagtt ttatcaaaca taaatctgga 
aaacacatat acaaaacgaa ttatataaat 
acccgcgctt tgaaagcgcg gatcaaaatc 
tttccgtctt tttattattg tagagttccc 
tggttatgta acgctcatac cgccgatagt 
gtaagttttt gggtagtttc tatttgcaac 
ttttcttgtg gccgcttttg ctcttttaag 
ccatttcttg gaggagttct tgagatcggc 
atttatccta taaactcagc cgtttcgacg 
tacatattat cctgttaatt ttttcgtttt 
atctatcaaa aattagcaca aaaggtaaaa 
atttacattg atttgataaa tagaaatagt 
taaattctcc gacatttcta atcaatattg 
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tcataaagct 

ttattatttt 

aagacattgt 

attttcgatc 

acaacttcca 

ggtggtgcat 

caccaaccgc 

gttacttgcg 

aaccgagtaa 

acaacgcatc 

acacggtgga 

ttaaagaaga 

acaccagaag 

aggaaacgcc 

attagtaagg 

gttttgaatt 

aggaaagtaa 

tacgatcgag 

atctgacggt 

aagagtatcg 

tgaagaagat 

ttcgtttggc 

tgagataatg 

cggggatggg 

atttctgtta 

tgcagcataa 

ttgcatggat 

ggtttaattt 

gaaaatcttg 

tagtcgtaga 

gaaatgaaac 

aaagaaaata 

ccaaacaaat 

ttggtatatt 

gttggtcaaa 

aagtaatgtc 

atattttaac 

actgtatttg 

tgttaacttt 

ttgatttaga 

atggtctcta 

aaaacacaat 

gttaaccaac 

gtttgtatac 

tagcatgcat 

gggacaccat 

gagtatctgg 

ccacattgtg 

agttataata 

gtcatttgtg 

tgactcgagc 

agtgggattt 

tatatttaag 

atcgttttgt 

tctacttgaa 

cttttttggt 

actaaataca 

tatttaggac 

gcaaatattt 

aaatttgagc 

aaacccaaaa 



gttatatttc 

ttgttaaatt 

tccttgaagc 

ctttaagcca 

caagcgtgta 

cccaggtaca 

ttaattctag 

gtccaaggaa 

atatggacga 

agctagttct 

ggaggtcatc 

ttatatccga 

atagcgggtt 

gcgagaaatt 

taacctaatc 

tactacacgc 

tcttatatct 

tatcttcaag 

aaagaaatgc 

gcgaattgtt 

gaaccggctg 

aatgaggtgg 

gatgtcatta 

ttattgtgtt 

cctaaccgga 

agggacaaaa 

atgttaaagg 

tctaatcttt 

gcagatcctt 

ctaatatgag 

actcggctta 

gaagtaaaat 

catatactaa 

taggaaagta 

agctgctagg 

atttcaaata 

aattacttta 

gatttcgtat 

actgcaaaga 

acaagaataa 

acatgatcca 

attaatttgt 

aacatgtgtt 

atcaattaat 

aagttaagac 

aataactata 

aacatcctag 

caaacctgca 

acatacaaaa 

ttacactttc 

caattgttag 

tggtaggttc 

tacttgaact 

gtcacaaaac 

tccggatctt 

ttggggttcg 

tgtacagagc 

ccgaaacata 

aaatatctga 

aaaatatcca 

gttgaaacaa 



cagcgcttaa 

attttaggtt 

tcctcatgga 

cgataaatac 

tgagcaagaa 

aagctggcag 

cgattgcgtt 

gagtaggaat 

cgatgttcat 

tggaccgcag 

attgtctgca 

ggttcctcgg 

taaggttggg 

gaatgatcgg 

caacactttg 

gttttgcatg 

tgtggattct 

aacttcaaag 

gaatggctat 

tgaaaagcaa 

ataccggtta 

ttattgagct 

gtgatctcaa 

taactgtcaa 

atttgagtaa 

atagccacaa 

tctcacaatg 

ggtactcaga 

tgaaattctg 

ttccttagtt 

gttagtacag 

tagtagcaaa 

aatctaaata 

atgttttaaa 

aaagtgtttt 

attgcagcta 

tcataaaaag 

ggtagcatcc 

atacagaacg 

taaaggagaa 

tcagttggtg 

tcgagaagat 

gatccaacta 

atgtataaat 

tatatagcat 

attcaaggct 

ctaactacta 

tatataatga 

tccaaccttc 

gataacataa 

catatgaaac 

tacttgaatc 

ttttcggttt 

taaatacatg 

ttgcgtcaga 

ggtaagtttc 

cggtaaattt 

ttcatagact 

aaatacagta 

aaatatattt 

aaaaattgta 



tagtgaataa 

gcagagaact 

actttatcaa 

attccagtgt 

ccagatgatc 

tttgtgggtg 

tcccagactt 

caacggttag 

taccaaggcg 

tttcagaact 

aaaacgttgg 

atgcaccaaa 

gatgaaaccg 

ttcataacgt 

cttcaatagc 

tgcatgctaa 

tcagaccgat 

acgggttcaa 

gaaaaggaag 

gaggaaggag 

tgctggtcta 

taggtgtgct 

tttggactct 

ttgcaaggta 

accgaaactg 

caggagtgat 

ttttagattg 

tatttaaacc 

gtaaaatata 

gatatataat 

tatctattct 

tgcctatgtt 

gatagaagat 

gtatagtttg 

gttcacaaaa 

acaccactga 

aaggaagaaa 

aaaataaagt 

aggtgttaga 

attctttaat 

aagttgacgc 

tatggaccta 

gtgtctataa 

tcgactgtga 

gtgtctctaa 

gatttctaaa 

tatgtagtag 

aaaaaatctg 

atttcagagc 

agagaaaact 

tagtattaga 

cggatctttt 

ggggttcggg 

tacagaacct 
tatttggatc 
aaccgggtca 
atgttatatg 
tgaaaaatat 
attttatgtg 
ctaaaatttc 
atgccgaatg 



tgaaagttaa 
taaacgtgat 
cgagatccga 
ttggaactga 
atgattcgtt 
aagaactcag 
ttgttggaac 
atcagattca 
tgatctcgac 
ttgataagcg 
gagagaagtc 
agaaggcgtt 
cgaaccacgc 
tgagatcaat 
gttttgaatt 
aatattacat 
aaggtgtcga 
gaattggaat 
aaaatggagg 
agtgagagtg 
acagataatt 
tggagagaag 
cactcggtac 
cagctactaa 
atatatattt 
tcaaaatgca 
acagaattag 
ggattttttt 
atttcttttg 
gaaggtaaat 
tgatgatagt 
gtgactagtt 
agttcgattt 
aaaatataaa 
tacaatgcct 
gaaatttgaa 
atcttagcac 
ctatcttttg 
aaattggttt 
gagattggtt 
ttcgtagatg 
tgcatagaac 
aggtggctat 
catgcataac 
tgtacgatcc 
gctagaaaga 
ttctactaca 
actacatcat 
cgttgcaatt 
gatcttgagc 
catggacatt 
gcgtcagata 
taagtttcaa 
tgagtgggat 
aatatattta 
aagtcgtttt 
ttggattcga 
taaaacctaa 
aaacccgatc 
aaaattatac 
tatacccatg 



aagggctaat 

acaatgcgtg 

ttatcaagaa 

agcttttccg 

catcaacggt 

taattgcgtt 

aaccgggaga 

agaacagaat 

gattttcaaa 

gtctagtttc 

gcaaaatatg 

gttaccagac 

cttgtccgag 

gattccttca 

tacgacaagc 

tatattttca 

ttcttgatga 

cttgcagaga 

atgaagatga 

atgtgaatgt 

taaggatcgg 

ggatattgct 

agtcttcaac 

cacaagctaa 

taatgctgat 

cttcaaagag 

cttttattcc 

ttgagatttt 

gttcataacg 

cagatgatgc 

acggtttgca 

aaagattcta 

caatattctt 

gataagattg 

tagtaaaaag 

catgcgttac 

aaggagttgg 

agtaagaaga 

gagacctttc 

gacatggtag 

gttgtttagt 

cttttttttg 

ataccaacta 

gatccagcta 

agctatttgt 

ctatggaccc 

acatatatga 

gcataaatcc 

acagtaatca 

caaagctggc 

tggatccttg 

tttggatcaa 

ccgggtcaaa 

tttggtaggt 

agtacttgaa 

gtgtcacaaa 

tcggttcaga 

aagaatatcc 

cgaacacatg 

tcgaaaaccc 

atacatgaaa 
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tgtttgatgt 

aagacccaaa 

tgaatctgaa 

gggtcaactg 

agatcaaggg 

ataagagggc 

cttcgcagtg 

ttacaaaagc 

acctaaatag 

aaacttgtcg 

cccacgaatc 

cgtcagacgt 

ccaccacttc 

gtcgttcttt 

gcttggtgtc 

gcttatggac 

tgtctggtgc 

aacggtgaga 

agcttagtgt 

gaacggttta 

agagaagtac 

cgatcggaga 

gcgggagcga 

gagagaggga 

gcgagcgagg 

gagcgagaga 

gagagggagc 

nnnimnnnnii 

gagcgaggga 

gagagcgagg 

gagagcgaag 

gggagagaga 

ccatgaacac 

taattatgaa 

tgaatagtcc 

atgagatatc 

ttcagcttca 

gacctgatcc 

cgtctcatgt 

ctgagacggc 

gtctgtgtag 

gtctagtagt 

agcttggaga 

taacagtcca 

taaaagatga 

tttcatctaa 

gtctttttat 

tgcagagaga 

atcttcctct 

tacattgtag 

caacttgaag 

taagctactc 

cctttccact 

agactcaaca 

actgctagaa 

cttcttggtt 

aggcttcaag 

ttccttcaca 

ttgtaagagt 

ctgcttggac 

ttcattcaca 



atacaatttt 
atgagacctg 
ccaaacatag 
gtcgacacgg 
tttgatcgag 
gcgaatactc 
cttgtgaaag 
catagtagag 
aaaaggaaaa 
tttaagttat 
tggatcatca 
acgaaggtgt 
gactggacca 
agttaagtac 
tgctgcaagt 
gtcttgaagg 
agtcgaaaga 
aacctgtact 
cccacgactt 
caacctccgt 
ctactaattt 
caataaatgc 
gggagcggga 
gggaacggga 
gagagatcga 
gcgacagaga 
gggagcgggg 
nnnnnnnnnn 
ccgggagcgg 
gagggagcga 
gagcgaaaga 
gagatcttat 
aaaatatctt 
gattatattc 
atagagatac 
tcaaccacca 
ccaccacaat 
aatgcttttg 
atcagatctt 
ttgaaaggcc 
tcaagaacaa 
gatgtttctt 
agttgcttca 
atctgctctt 
aatcattgat 
catatgaatc 
actcgactcc 
catctacggt 
agtccgtctt 
ctgtgcggat 
ctctctcttc 
agtgctttag 
gtgaacctcc 
cgtttcttgc 
acatcgccag 
agatgtttga 
agaacgatct 
tgaggtctct 
gtaatgaaca 
cggtcttttc 
caaacttgaa 



ttggatattt 
cagatgaaaa 
gttggaatga 
ttcatgcaaa 
aagccaataa 
ttcgcagtgt 
ctcttaagaa 
ttctttatat 
acataaacct 
gatgctgctg 
ttgtcgcctt 
gaaggcaacc 
agctttttag 
acccaaacga 
ttgtatttag 
ttggtaataa 
tccaacggac 
gcgattagtg 
tatgttatct 
ttgaccatca 
ccataacgaa 
gggagcgaga 
gcgagggagc 
gcgagggagc 
gggagagagc 
gagggcggga 
gagcgggagc 

gagagaggga 
gagcgagaga 
gagagcgagg 
aacaacctga 
gttttcaaca 
tcaaactgaa 
cagattaatc 
acccttgcat 
ccaaccaaag 
gagagtcaca 
cttctctaaa 
aaaacttgat 
gaagaagctg 
cgttgcatct 
caaactcgtg 
ctgcgcatga 
cgatcagaga 
ctctctagta 
tgcaacgatt 
ttcttaaaga 
ctacaggagc 
ttggatcgaa 
tcttctccag 
gaacttcata 
ttggcgaaac 
gcctaaacac 
aagcctcggt 
atctcaagta 
catccaaatc 
ccaagtcttg 
actctttgtt 
ctttctcctc 
ggttaatctc 



cggatatccg 
aaatacccaa 
gtttgattta 
ctgatagaga 
cgtatttttg 
tcgttaaaac 
aatcaatatc 
agaactctta 
taagtaaaaa 
catcaggtcc 
tgaacggaga 
tgacacgata 
acttcaattt 
gatcacctgg 
tcgtagccga 
agtcaatggc 
cacgaggaac 
gcatgattgt 
cccacaagac 
gtctgaggat 
cgccgaaagt 
gcgagagcga 
gagggaggga 
gagggcgaaa 
gagcgagaga 
gagagaaaga 

gggggagcgg 

nnnnnnnnnn 
gggagcaagg 
gcgagagaga 
gagcgagaga 
tcaagagtaa 
acagctgaaa 
attattttat 
catacaccta 
aatattttca 
acatgatggc 
gcagagtaaa 
ggaagctggt 
gtcttgacgg 
gtcgttgttg 
cgccataaga 
aacagagtct 
aaagtaaaga 
taagagacta 
cagagacagg 
tagagaaaga 
acgtacagct 
tctcaggctt 
aacctccaag 
cctctctctt 
aaccggtgaa 
atccttctca 
tggtgactcc 
gtctttgtta 
ctttctgtcg 
atcagccagt 
accttttttg 
cgagcttaga 
tgccttggaa 
tgagctcttc 



attgaatctc 
catgtatttt 
ggctggattg 
gcggaagcta 
aagataactt 
tctttagggt 
tttattaatt 
aaccgtcata 
ggaaataact 
tcccggttga 
caaatgcttg 
cacattcgga 
attgtaatca 
ttcgaagatc 
ttccaaatga 
ctctccatga 
aataccatac 
gtgcaaactc 
aacgaagtaa 
ggtaagccaa 
gactgagaaa 
gggcaggagc 
gggaacggga 
gagcgagcga 
gcgagatagc 
gggagagaga 
gagcggggga 
nnnnnnnnnn 
gagggagcga 
gagagcgagg 
gagagaggga 
taatagctgg 
gactattgaa 
atacatagca 
atctgcattg 
acgacatcag 
tttatcagat 
ctccagtcaa 
acattgacga 
taaatctcgc 
tgattaccgt 
ttggtccagt 
ttctcgacaa 
gatagtgaga 
actgtttctg 
actcggttgc 
gattaaaaca 
gatgagctct 
gttttctcac 
tcctccttgt 
ctctctttga 
gaatctgatc 
atgcttagct 
gggcttctat 
gatccaacaa 
tcaaccaagc 
gaaggagatt 
gctgagaacg 
atatctaatg 
ctgatcaatg 
tttttacaat 



gagtagtgtc 
attatatacc 
tacagtatgt 
tgtaacttca 
caaaaacggg 
tgcgaatact 
caagtctgtc 
gaaattttaa 
agaaagaagg 
gaaggatttg 
atgttgaaga 
ttgatgcgtt 
cgcaagggga 
agctcacaac 
gactgcgcga 
aggcgagttt 
acaacacaaa 
cgcctgtcca 
attacccaaa 
gctcatgtca 
ccgtgtgtct 
aagggaggga 
gcgagggagc 
gggagcgagg 
gagagcgaga 
gagagaggga 
gcgggagcgg 
nnnnnnnnnn 
gggagggagc 
gaacgagaga 
gacagagaga 
agagagaccg 
cacagaaaac 
aaaaaaatgt 
ttcttagctc 
agactacacc 
cagcctcaac 
aacgtctcaa 
ctctcgagct 
ggagaatttc 
gtgagaattc 
tcagtcttga 
gatcaaaaga 
cttaagtgat 
aggccaactg 
tcttgttcct 
tcaaatccgt 
ctagatatct 
tgagagacct 
ccaaggacca 
gttttgtctg 
tctcctgttg 
tgaagacatc 
agccaactgc 
caagcttcaa 
ttggtagtct 
tatcacgccg 
agttaggatc 
cttccatgaa 
tttcagcagc 
ctctcccatt 
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146281 cttcagatcc atttgcttag ccttctcaac ttcactcaaa gaagtttcct cctttgcatt 
146341 cttgatcctc tttatctctc tctctagtaa accctgaaca ggttttgatt catagctctc 
146401 gtcttctctt tcagaactaa aggatcttct cctcgtttta ggcttacatg agttctctcc 
146461 ttccacaatc cgctgaacat atcaagagat tcagttttct gcttccaaaa aaagccaact 
146521 caaaaaaact aaacaatttt gtaaaaactt acatcatgac actgacatct ctcatgaagt 
146581 ctcttgagca aatcaagttg gcttttagta taaacattac ctgcaaacaa acaagaagca 
146641 aaattttaaa actaagatca tgaagtgatc caagaaacac acttaacagt actactaacc 
146701 agcagagtct ctgatgggtt tcttctgtga actcagtctc ttattaccag atttaacatg 
146761 tctgaaatcg aagaagttaa tcagtttcca accacaacca gacttgtaac tctcgtagac 
146821 aagagggctt ttagggaaaa ttggagttcc aggactcttg ctgcttcttg gtgatctcct 
146881 tttctccatt gattatcaaa taactcaaat cattcaaata tcatgtctac ttcaatctag 
146941 taaaaaaaaa aacgattccc aataacctga aacacacaca gaaaccctga tcttttagac 
147001 aaaatatgaa aaataccaac ataaacaaaa aaaaaacaga ggattcttga agttgtgtcc 
147061 gaaaattaca tcttcaggat acaaactgtc aattttactt tctgggtttc atttttattc 
147121 cctactctaa accctaaact gagatttcag tttcttcact aaaaaagcct tcctttagtg 
147181 taataatgta atatgtacat gtatatatgc gcttgtacag acttatctat cagagatttg 
147241 cttaccacta agctttgagg ttgaaatggg acagaaaaaa atgcaaactt tttctttata 
147301 cagtcagtgg agccaatcaa gcactgagct ttgtcatgga ctgatgaact ttttttaaga 
147361 gttaaaaaaa ccgcaaattt tgtttccaca tacaaaagaa aaagggctaa aacatgattg 
147421 aggttgaaga tcttttgcga aaattaaagg gacccttgtc tcttcctcta cccttctaat 
147481 taaaaaaaag gatatttttc tgtattcaag aaaaattgta aggacaataa gtgaggatta 
147541 ctgtttattc ttggagatgg gtcctctcac cattaaagag gcttttagtc tcgtcctgga 
147601 gaaggaggga gaggaggagg aagatgaagt aagactcttt ataaaataag atatatgatt 
147661 ggaagagaaa tgtaaataat caaaagacca attgaaaata gatctttatt agaccatctg 
147721 acatttttat tttgttttct ttttaacagg caagcagaca ttttcttgac agtgggattt 
147781 ttatttattt ctagtaaaaa tttacaacta cgtttttaat tagcaaaata aataagcgaa 
147841 aatgaagggc aggagaggga cgtgagaatg aaagagactg taattttgca gatcgtttca 
147901 catataagac aatgctttca aataaagctt ttatttctgt catcttctta tctacaaatg 
147961 catgggggaa taggatattc atgcatcaat ttagctaaaa ttagataatt agtatttgaa 
148021 ctcagaaatc attcaaacct aataatttag ctcgattatt tctaaactta gatattcaaa 
148081 tttcaaaatt atgatatata tcttgatgaa gttaaataat gagttggatg ctaaactaat 
148141 tacaacatca tgcaatatat ataactgagg caatagagat cttatatctg atgtgttagt 
148201 tgaactagca tgtcattcat cattgaactt tgtcaacaga gatattaacc ttagtaaatt 
148261 ttgaatatgt aattaaaaaa aattgaattg acgtgtatgt ttgcacataa tttagcctta 
148321 agaattaatt gttgtatttg aaataagttg agatatgaat aaatcattct ccctataaat 
148381 gtgttttgtg tgtggacaag acttggattg aagacttgga ttcaccctct atggtaaact 
148441 tttaaatttg ccatctcact aataaccaat caaagtgtca actagattaa taattaaaaa 
148501 atattaaata actaaccaaa atagttagaa aaataaatag tgtcaatttt aataatgcta 
148561 atgtctctat ccaatccata aactcaaact ctaaacccta aattctaaat tataaatcca 
148621 aattctatat cataaatcca aatattaaac tctaaaccca aattttaaat cttaaaccca 
148681 aaccataaat cataaaccca aacctataac ctaaacccaa acctataccc taaacccaaa 
148741 acataaatac aaaacccaaa ccctatactc taaatatttg gattaatatt gatgttgttc 
148801 ttttgaaatt taaggttaag aattaaagtt tgggttacgg tctatagggt ttgggttacg 
148861 gtctagggtt tggatttagg gtctacggtt tgggtttaga gtttatggtt tgagtttaga 
148921 gtctaggatt tgggtttagg gtctagtgtt tgagtttagg gtatagtttt gggtttagat 
148981 ttagattttt tggtttataa tttatggttt gggtttatgg tttaaaattt gagtttaggg 
149041 tatagagttt gagtttataa tttaaaattt agagtttgga atttggattt aggtcataga 
149101 gtttgagttt aaggattaaa ttgtgatatt agcattatta aatatgacat tatttatttt 
149161 ttaattaaca aatattaatt tagttggcat tttgattggt cattaggaaa gtggtgaatt 
149221 taaagattca acataggggg tgaacccaat ttttatcttt tatgtgtgtg aatgtttgat 
149281 gaaacatata cattacagag ttgaaaaaaa agaaacatat acattacaga ttacagtagg 
149341 tttatgtggt taaaacattg atttatcact gtgcgtaaaa ggtctgtcac aaaagtgtac 
149401 cttttctttt tggctcaaac ttttcctttc tttttatcgg aaatataacc gttagaatag 
149461 aactcaagta aatgaaataa gtagtacatc tgcttttaat gtttgttatt attttagtcc 
149521 taggcctcgg catttcggta tcggttcgat ' tcagatattt cgaattttag gtagttcgga 
149581 tagaaggtta gaggattcac tgagtactta acttcttttc gattcggttt agttcggata 
149641 gtttggggtt cggttcgata gtaaaactag aaaccaaaaa gtacccgaaa taattttggt 
149701 tctcatttgg ttccagttcg gattcggata gctcggataa atattagtta tttaggataa 
149761 aatatcaaat aattatgatg atttaaataa aatgttagaa tattttggat tattttggat 
149821 atttcggata aaattatccg gatacttttg gttaatttgg atatttcgga cattttataa 
149881 taatgtagtt atcttcaaat atttttgaat tcttttaaga aattttttaa ttataaatat 
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atatatatat 

gtatccgctc 

accatacgga 

ttcggttccg 

agttccaagt 

gtaacgtaaa 

ttaatacgtg 

atttatagac 

gttctttctt 

atcttacaag 

aacgaggaaa 

gtacttgttt 

tttatttagt 

ctacatgcgt 

tatacatata 

gtagttgaga 

taattatcgt 

tatcggagtt 

tcattgaaaa 

ctgttatttc 

tttttttaag 

tcgtatttta 

gcgtggttgt 

actttacacc 

cataatacat 

tagacgaact 

tcatttactt 

cccaaatata 

aagcaaagtt 

aataagttac 

ttatatatat 

tattgcctaa 

ttaataatct 

gccggtgtta 

ctatgtaaat 

atataagtta 

taacatttca 

tctatgaaac 

ataaaattta 

aatagaaaaa 

taataaaatt 

ttacgatata 

atcaaactac 

tctaaagact 

gtagtaatgt 

aagaaaacag 

atgagccggg 

gtattaaacc 

ttttattttg 

tagaactctt 

ttgatttgtg 

tctatcattg 

cggtgaccat 

acccattttt 

gttttccaat 

caaacaacca 

ttataagcta 

ttgccttaat 

ttatttaagt 

tatatatttg 

taatataatt 



acatttagtt 
gattctcggt 
tatttgaagg 
gtttttttgt 
acaaaaatga 
gactcttata 
tgtatacatt 
tggtgaacaa 
cttttctttt 
tatcttttta 
taaaatacac 
tagtagttat 
cctaatgagt 
tttttgtatg 
atttatatag 
tggtgacaac 
ttttgcatag 
gtatatcaat 
tgttattatt 
actagtgaca 
tctaagtcaa 
taaaaatggt 
ccacttatct 
aaagtttctc 
acatatgata 
attttagata 
tatttttgta 
caagcaagac 
gaatgcagaa 
atgttagaca 
atatatatat 
ttaatgagag 
ataaattaat 
aatataacac 
atatagtctc 
aatattatat 
atatattttg , 
ataaaattat 
aataatgtac 
tagaaaaaat 
tcaaagtccc 
atatgtcaag 
attgcttata 
gtgacatggt 
acacgatacc 
tagttatatt 
taaatgaaga 
aactcaactg 
ttccgtcagt 
tcaatatact 
gctggtgctt 
ctcaaagtgt 
tttagctgct 
tcgtaccatc 
ttttctatat 
taatacgatt 
ttacatgtta 
cccgcgattc 
ttgtaaactt 
agatccatat 
ataagtacca 



atgttatatg 
tcggttccgg 
tcctcggttc 
ccaagcctag 
atatagttta 
cttcccatct 
tatatatcaa 
aataatataa 
ttcatatttt 
gaaattatta 
tggaatatat 
attccaaatg 
tatgtgcgtg 
atcaaatatt 
gtttcaattt 
gactgttttg 
aacaaagcat 
aatgtttttt 
tttctttact 
aattcaagtc 
ttaagaatta 
gtataccatt 
taagctacta 
tccgccgtcc 
tattacaatc 
gtgtgccgac 
atatcaaaac 
aaatgcactt 
tatgaagttt 
caatataagc 
atatatatat 
tttctcattc 
aatctctata 
aaatcgataa 
cttaaaatca 
tgttggtttt 
tataacattta 
acataaaata 
atacggtaaa 
ctaaataaag 
aacattatta 
attaccaatc 
aaaacaccaa 
tgagttttag 
cacgttaatt 
acttatgtac 
aacgaggctc 
taaatggcgg 
gaccaagata 
agaacttaaa 
gttctgttta 
tacaaaaaag 
gccttctttg 
agtacaagtc 
tttaaattat 
tgaatagata 
aacccgttct 
aattatgtct 
ttattaaaaa 
attgtactgt 
atttgatttt 



tatatataat 
ttcggttatt 
ggttcgggtc 
ttagtccttc 
attttcttca 
atgtcatatt 
caagttaatg 
tctatagagc 
gctatcgcgt 
tatcaaacta 
tactaacaca 
ccaatgcatg 
aatgaataaa 
atattatatg 
tttaactatg 
gacttttaat 
tggtcttttc 
atgaaaaata 
atcagtcagt 
ttttgcaaac 
taacttatgt 
aatatgattg 
aaaaaagtta 
aatagaatat 
gtttgtcaat 
aaaggtatca 
acgaaaaaat 
tttagaataa 
accaagcaat 
tctaaattct 
atatatatat 
tataaactaa 
aattaataaa 
aataataaga 
taaattaata 
atattcacaa 
gtaaaattat 
agataataaa 
ataaaatatt 
aaacttttgt 
tttatagagg 
ttgacatatt 
cgtattaacc 
ttttattaat 
aagtcatttc 
atttagcatg 
aagtggaaaa 
tggtcgtttc 
ctggacctgg 
aagaaataca 
gtttgaacat 
ttttggatgc 
accatgagag 
tggttcgact 
ttccgagttt 
atcagagatt 
ctatctaccc 
ttacctttct 
ttaagattga 
tttataatca 
aataaaattt 



taatattttt 
tcggatatag 
cggatatttc 
ttgaacattt 
cacactaatc 
acgtgttgac 
tgtatatttc 
agccgatccc 
ctatccgtct 
cgcttattct 
atagtacaaa 
ttcacgaaga 
tcattatgag 
aaaaatagac 
acttgttaat 
tcaccattca 
aaatgcgtgt 
tatttaaata 
ttcttatttg 
gtgtaagttt 
attatgttag 
gttgactagt 
ttaataatca 
tactatttat 
tgtcatacac 
cgacgtatta 
cgaccaatat 
gaaactcatg 
taattaagat 
taaggaaact 
atatatatat 
tacagtagaa 
ttttagcggt 
taataatatt 
atttatctat 
tgaaaattct 
accaaaaaca 
acaatattga 
ttctttttta 
aaattaatat 
ttttactgta 
atatagtaat 
atatcttgta 
attgtgaatg 
ttggtatttt 
tgtacctcac 
gggaccacaa 
aaattaaagt 
ttaacataag 
aactagactt 
ctacagctaa 
agttatatat 

ggggatattg 

tcctcgatta 
tagttcattc 
cgagctaata 
aaatccacca 
tgcatttgta 
aatcaaacct 
catatataat 
aaattagtaa 



atatatttgg 

aaatatagac 

ggtgcggttc 

accaattact 

ttgttaacta 

acatgcacat 

aatagtatat 

aatactgcat 

atggttggtt 

tcagatgata 

aaatgaccac 

atctggcaac 

atctacaatt 

accaaatatt 

tgttattgtc 

cgaacttttg 

attttactgc 

aaactataca 

cttatttatt 

agcttaagtc 

tacaatctaa 

aacggtatac 

aattcatcta 

gccaagttgt 

gtaaattcag 

catcttctca 

ttccaacttc 

cagtcatgtg 

gttatcatac 

ggagcactgt 

atatatacag 

cctctataaa 

tccaagttgg 

tttaaaattt 

atataatttt 

ttacttttct 

acttaacaat 

agtcaagttt 

tataaataaa 

ctctataaat 

taatgttata 

atgttaatta 

acacgtatta 

gcagtcgcat 

atgatacatt 

gtgttatgaa 

agggtgttat 

ggccatgtga 

aagaatcaaa 

cattgcaatg 

tagaaaggtg 

atgtagctac 

aagggaaccg 

gatttaggcc 

tcacacaagt 

agcgagtttg 

tcttttcttc 

actaaatctt 

ttatttttgt 

tcttcaataa 

taaagtttca 
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acaaccaaat 

tgtcaaaccc 

atttgtaatc 

atgaaatact 

gttatagtaa 

aggtgctctg 

cgtctgtgca 

tgtcgtttta 

ctacttatat 

atgcaaaccc 

ggtaaattgt 

gttaagtggc 

agtagtttgt 

tagttctttt 

tctttctaac 

aaagttgtag 

gagtttgata 

accaaagtat 

ggtctatccc 

tgatacttgg 

aacaaagtac 

ccgatttatt 

gtaaccaaaa 

gcacaaagat 

atcattcaat 

accttaaaac 

taatttgaaa 

aaaaacttat 

atatttttaa 

agtatataac 

aaagcgaaac 

ttaagtggat 

atatatttag 

gaaatttggt 

cgaatacttg 

gtatttgtgc 

cgtagtgaca 

gaatcgaccg 

agatgacaag 

tttggtgggc 

accgtattgt 

tcacactccc 

cctctctcta 

ctcccgaaat 

cccgaacccg 

caagtttcag 

cgagaagaac 

gggacgcgat 

aatgaaacgt 

gtcgtctaag 

agaagaagct 

cgaagaagac 

tcaattcgag 

gttcagactg 

cggcgactta 

agatctcttc 

aggagcggtg 

agcgctcgtc 

taggcatccg 

caccacgtat 

cagtgtttct 



tttctaaatc 

aaattaaaac 

ttactctatt 

atttattttt 

aattctaaca 

gtacgaaggt 

gcaacatggt 

tcctacccaa 

ttttgagatc 

aatcgtggac 

gaacaatata 

tgaagtgaag 

tgccgtatct 

gttcaggttt 

tttgccgatt 

tagtccagtg 

cttgttggaa 

cactgtttag 

taacaataag 

cttggttcgt 

ttactatatc 

gttaaaatca 

aaaaatctta 

taagaaattt 

caattataaa 

ctcaaaatat 

cggagggagt 

tttatcaaat 

tagataatta 

aagttatacg 

aagacaaaca 

taaggttttg 

gaacgagtaa 

ttgtatttgc 

attttacgga 

ctagttatac 

gcttcgcact 

gatagtgaaa 

atacaggaat 

ttggtcatta 

cgtttggcaa 

tctcctgtct 

ggtcgaacca 

ccccgtaata 

gcggatgacg 

cagatacagc 

cgaaacacgt 

gacgtggagt 

agcgagttcg 

aagatcgacg 

gtggatttag 

tgggtcgacg 

ttcgatttgg 

agcttagccg 

gacgtcgaga 

gtgtgctgcg 

gcggttgtgg 

atcgtcgaag 

tcgaaggaca 

ttgattaaaa 

tgttacgtcc 



gataattcta 
aatagtcaaa 
ttttattatt 
tactcattac 
cctttataat 
cttttcagag 
ctgttctgcg 
tggtttcatt 
cttacatcgg 
tagagcagtg 
ttaagaaact 
tctttcaact 
ttctggtctc 
ttgtctcttg 
ttctattgta 
ataatttttt 
actaagtatc 
ccagtgagtg 
gaatgggcac 
tcaaataatc 
aagtacttaa 
agtacttgtt 
tactccctct 
acattcttgt 
atttttattg 
tatatatttt 
aaaaaataac 
ctaatatgtt 
aaaattctag 
gcttattatc 
agccaagtaa 
aaactgtaaa 
caaatccaaa 
aagtttttta 
gaaagtacaa 
ttaacatcag 
aaatccattt 
atgaaaaaaa 
caaatcaacc 
ttacgccaaa 
caaaaaaaaa 
ctccaatggc 
cctcttcctt 
atcttcagtt 
acccgcccga 
gccaagccgc 
acctctcggc 
ctggaggcga 
ttaaaaaagg 
aggaagaaga 
acgagatcga 
aggaagggaa 
atgatttcgg 
agctcttgga 
tcaccggaat 
acggaggaga 
ctagcaaaga 
acaccgaagc 
tggcggttat 
gcctctatga 
atggagataa 



cagccaaaac 
gtcttgtctt 
ttataatata 
tctagttttt 
ttgtaaaacc 
tgttggatag 
tccttcggga 
tggtcggtgt 
gtgtaatatt 
gattagtcgg 
ttatgatccc 
gacatcattt 
gttgtttaga 
aatgttcttt 
actcatgttt 
tggagctcag 
actgcttagc 
tttgggtttc 
aaaccaaata 
atattcacga 
gagaaaaata 
ttaaaactat 
gtttttcaat 
aaaaagaatt 
gttgaacagt 
gaaacaacat 
tagacgaatt 
tatctgattt 
aaacaagtag 
cgatttaaca 
aaagaatcat 
aacttatgcc 
caagtcgatt 
tttgaagtat 
gacgagttgc 
ttggaaagta 
tttacaataa 
aaaagaaaag 
tattatctgt 
cccacctcgt 
ctcttatccc 
cttcactttc 
ctcctcctac 
agcggcgggc 
agccccggag 
tcgagcgcgg 
catcgctgac 
tctgttctcg 
actgctccaa 
agaagacgac 
taaactgact 
ccctaggatc 
cgaatccaag 
cgagagcaaa 
ccagcacgat 
cgactccgtc 
gatcgatatt 
agtcttggct 
cggagtcacg 
agctatgggt 
caagatggat 



ataaagtgaa 
aaacaatcag 
gtaaaaatta 
ttactttaaa 
aaaataatat 
tttttctgct 
tatgcctctc 
ctatgtttgg 
cgagtaaggt 
atcagtgttc 
taatctcttg 
gttgctaatt 
tagtccgtag 
ccgccaactt 
tatttcttca 
cgtgtatcca 
cagccagtgt 
gaccaaaatg 
cttgctgatt 
tttggaattt 
gtacttgcaa 
ttattacttc 
atatgatgtt 
ttaaaatata 
tttcaataaa 
aacacttcta 
ttgctatctt 
ttgttacaat 
cgaataatat 
atgtcaaata 
ggctcacata 
aatgttgatt 
aacttttttt 
ttgttacttg 
gagtaaaagg 
ctccaaacct 
gtacattccg 
aaatctccaa 
taccatttta 
atcactaaca 
cttcgaaaaa 
ttctctcctc 
aaaccggcat 
cccacccgtc 
gattcgatgc 
aagctggagg 
gtggaagacg 
gacatcgata 
cctaaccctc 
gctgttgatg 
ggattaaccg 
atcagcaaga 
gcgagaatcg 
gtggtgccga 
tcgcgaggcg 
ctgagcgaag 
gaagatacgt 
gcgttagctt 
gggactaacg 
gtgagaacag 
tcaacgacga 



aaatcttacc 
agcctaaatc 
gattactata 
acaaaataaa 
tttacacaag 
tgtgctcaag 
tacttgggca 
ttctagtgct 
tagttcatgc 
tcgtggcttc 
ttttagtaat 
catcctagtt 
taatcgcgta 
tctggctttg 
ataaacaacc 
tatttgttct 
ttgggtttcg 
atttacaatt 
ttcaaatgct 
acttgtttaa 
gttacttgat 
aaaactgtaa 
tttattgaat 
tgattttaaa 
ataaaatttt 
aaacatcata 
tttttaatca 
aaaagacaat 
gtaaacaata 
cttgattttt 
gtttttaaat 
aaacaaaaag 
ttaaagactt 
accatcactg 
cgaatttcca 
aactaaacag 
actagaatag 
taaattccta 

agtgggctat 

ctcgaaacgc 
cctttcttca 
accccgtctt 
actcaccatt 
gtagctctta 
acggcgtctc 
aagaagactt 
cgcccgagac 
gagccatctc 
ccaaaacggc 
agcttgacga 
aagcttccga 
agaaggagca 
tggagcctaa 
tctcagtcta 
taagcgccgg 
ctgacaagag 
taggctgtag 
cttcgtttta 
ggaagacgac 
gaatgttcag 
cgagtcctga 
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tgctgttttg 
tatggaagct 
tgcggtcttc 
tagagatgct 
ggttaacatt 
tgttgtgacg 
gtctttgttt 
tggtttgtta 
tgtcggagct 
gaggtgtgag 
acctgatggt 
tactggtata 
atgtattgtc 
acgaccggtt 
taatccgggg 
tgatgtgtaa 
gttggctggg 
tcctccattg 
ttgtgctgtt 
ctctgtacac 
taaagtaatc 
ttctttgcag 
agagttctat 
tcaagctgga 
tttgccaaat 
cttcctctga 
agagtcatta 
cgccaagata 
caaccctttg 
agatgtgtac 
tggcaggtgt 
ataaaacaaa 
ggcaactcca 
ggatggtggt 
caagagggaa 
aattcagtgt 
gagtgcagct 
ttcttctttg 
aaaatctatg 
ataatcataa 
aaaagaaaag 
acaccaaacc 
tccaaaatta 
acactcatct 
aaaccagtca 
tcctggaaaa 
gtgatgtaaa 
agcccatcat 
acgttgggag 
cggctctgct 
cctctgtgtg 
aagccgcaga 
ctagcatcaa 
agtttcccat 
acaccactag 
cagaacccgt 
atctcatgga 
agagtagtga 
tcctcagcag 
attgtattag 
aacttgcctt 



gttcagagta 
tctcctcaag 
acgaatttag 
gaagccaagt 
gacgatccaa 
tttgcaatgg 
gagacacagg 
ggacggcata 
cctcttgagg 
ttgattgatg 
ttgtcaaggc 
tacatcttac 
gtctgatttg 
atgacgaaaa 
aatgaagatc 
caggtagaaa 
attggatgga 
gcaaatggtc 
gcgatgggtg 
cttttttact 
tcagcttcta 
gtggtagcag 
gatgatcgag 
atagacacaa 
gattaaacag 
tcacactaaa 
atatctctct 
gatttgcaga 
tgagtcatca 
aaatttttgt 
tttttttaat 
atcaacacaa 
tactaattag 
tgttgtttca 
gatggagcta 
taaccgaaat 
ttcttttttt 
ctttttctgt 
aaactgttgt 
ccgatggaat 
tgtctttcat 
attcatttaa 
taaaggtaag 
gcagatcctc 
gcatattgcg 
tgtctagagc 
taattgcatc 
caacccttcc 
agaagctctt 
tgcatagtcc 
gcatctcctc 
tcaatatatt 
gatccatctt 
tatcgcagag 
aaatcatctg 
gaataagagt 
gaagttccat 
aagtgaatac 
catcaagacg 
gaatgatacc 
ccttgacaaa 



tgatggcgaa 
aactcgcttc 
ccagagagga 
tgtttgcaag 
acgcagcgtt 
agaacacgaa 
ttttgctcaa 
acatttataa 
atattgttag 
aggaacaagc 
tgcttgattc 
gtctatttaa 
ttaaagttat 
tcgcaactga 
catgtgagta 
actctttaaa 
cgatgcaaga 
atagactctt 
aagaaggtga 
ttccctggtg 
agttacatta 
ggaaaggcca 
aggaatgtcg 
gcgagttccc 
agattctgat 
tgttctttcc 
atggtgagat 
gage t cat ga 
ggtaagtgta 
tagtggttgt 
aataagcett 
atgagtagga 
caagacctaa 
aggtagagga 
cgaaaagaag 
tggatgtgga 
taaccacgag 
ttttaaaaat 
taattgtagc 
tcacctacaa 
tctttcattc 
gaaagtttcc 
caaagaaatt 
aagcattgee 
gatagtaatg 
cccattaata 
agcaactatc 
tgccttacag 
gctacccatc 
atcgatcatt 
gtataattcc 
gtaagttaga 
actcttctgc 
accgtccagc 
ctgtgaaagg 
gttgtaagta 
tccatcatct 
gtceggagag 
atcctgtttg 
ccttggaagc 
tgeattgate 



gatgttgcat 
agggaaatgc 
tagtggcttt 
aatggtcgac 
tttcgtccag 
agcggatgtt 
tacacctcag 
cattcttget 
aggtgttgag 
ttttggtgtt 
ggttcgagag 
tccatatcca 
tggctgtgcg 
aaagagtgat 
aatctgagat 
tgtgatttca 
gtatctgaaa 
ccttcacgac 
catggttgta 
tgttattaag 
tgcaatggtg 
egaagegtat 
ggaagcatta 
atggaggtaa 
ctgtgagagt 
tegtattget 
cattaaacca 
attctgtacg 
ggggagagaa 
tgcgtttgtc 
cattggctcg 
cataccacac 
ccaaaagaaa 
ctaaggagtt 
caaggtcata 
atcattgtta 
ctggagagag 
ttgatgaaat 
aataagaatc 
aataaaatca 
atcctccaac 
ctgatgetae 
tgaaaacaaa 
actgcccttt 
gtatcaggat 
ttacccactt 
cctcttcgac 
tagecattaa 
gaaacaaaca 
gagctatagg 
tcggcctcta 
acatcaggtt 
atagecttaa 
aaagtgttac 
tctagagcag 
actgtgttag 
atcctcttag 
cagcccttgg 
caaaacccat 
atetcategt 
aaagcattat 



aatggaaccg 
gacgaagttg 
cgcggtactg 
ceggaaagae 
caagggaacc 
cacccgttga 
ggtatactgg 
gctgttgctg 
gaagtcgatg 
attgtggatc 
ettaagecaa 
ctagttctta 
ggtgagactg 
gtgacgatgt 
tagctcgagt 
gtggacatat 
caeggagaac 
attagaegtg 
agtatttaaa 
gttgctggat 
tttcttgaca 
cagcttgaag 
caatacgttg 
gtagaagaga 
tactgaaaca 
ctgcttttgc 
tgaatcagat 
agagacttgg 
gagattctaa 
tgtgttgagg 
aaattaaatt 
actgeaatga 
accctatggc 
gcagaaggag 
gecgaaegga 
aceggagtte 
tgeagcttte 
cacagaacaa 
aatagagaag 
acaaggcett 
tgataccctg 
agcataagct 
atgtgttgaa 
ctagttcctc 
acacaccact 
tacgaaaacc 
ccatctcgca 
tgagtgtgtt 
tttgtgtagc 
tgatagtatc 
aaaacttccc 
ccacaccatt 
acatttccaa 
aagtaacgat 
cattaagatc 
caactaatcc 
ccccacaata 
tagecatcaa 
cgatcattga 
ataattctgc 
aagttacaac 



aagctctggt 

atttcgacat 

atgaggagta 

acaggaaagt 

ctgatgttcc 

agtttgagct 

aaatttcgtc 

tegggatage 

ctgtcccggg 

atgetaacac 

gaagaatcat 

tggtgtttat 

agagagggaa 

tgacatctga 

ctataaccct 

tggatgacat 

atgattacta 

tagctgtgcg 

taaacccatt 

tgaaattctt 

ctgatttact 

gtgataagaa 

atgagcttca 

aacagcctct 

tgtcaagatg 

aggttaccag 

ttacttccac 

aggtgtttaa 

aggtggagga 

gtttaagata 

ggatgagaac 

gaaggaataa 

gaatatggaa 

gttctgattg 

gttggatgtg 

gagctggaga 

tetgetegtt 

gtgataaaaa 

cgtagctaca 

gtaaatttgg 

caaaacaaac 

gcagctcttc 

tgacataccc 

tttactccaa 

tgaaatcatc 

ataaatcaaa 



gaaaagctcc 
aaatgtcact 
ctcatctagg 
tgggactata 
ttcattgatc 
gaaggggtga 
tgcatctttt 
atcagggcac 
gcccaccaga 
tcttctaggc 
teegtctatg 
ataaaacatg 
attatatgtg 
agectcgaag 
attagggctg 
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160921 
160981 
161041 
161101 
161161 
161221 
161281 
161341 
161401 
161461 
161521 
161581 
161641 
161701 
161761 
161821 
161881 
161941 
162001 
162061 
162121 
162181 
162241 
162301 
162361 
162421 
162481 
162541 
162601 
162661 
162721 
162781 
162841 
162901 
162961 
163021 
163081 
163141 
163201 
163261 
163321 
163381 
163441 
163501 
163561 
163621 
163681 
163741 
163801 
163861 
163921 
163981 
164041 
164101 
164161 
164221 
164281 
164341 
164401 
164461 
164521 



atcttccttt 
caaaatccac 
atttcaatga 
atggcactat 
agattcaatg 
gtaatctggt 
acaactcgac 
ctacaaattt 
tggagcaggg 
aatgtagaca 
aagctgtata 
agatccgggc 
ggtaaaggac 
atttcataag 
tctccatccc 
gccgcagaca 
cctgagatta 
tggcttgagc 
ttcggttggt 
ccaaaacggt 
acctgaacaa 
cctgaactca 
cattgaaata 
ttttaaaact 
attatatctt 
aattatctta 
tttttatttt 
tttgttcaaa 
aaatattaaa 
ataatctata 
gtaattataa 
taagtaaaaa 
atttaatacg 
gtagatttat 
gtcgtatttg 
ttggaataaa 
actcaactct 
aattattcat 
taaattttaa 
ttgttttatg 
gactatatta 
tattttaaac 
taattagtca 
gttttatatt 
gatccaagat 
ctaaaacttg 
aattatgttt 
tttctatcga 
taatgtattg 
ttataaatta 
tttttttttc 
aacgagcccg 
accaaaaccc 
ggagaatcaa 
cacaccacca 
ccggagctca 
taatcggaga 
accctagaga 
cgacgagacc 
aaaacctaaa 
tcacaccccg 



ctaacatttc 
cgatcataca 
aaagattatg 
agataaccac 
cagacacagt 
caggctggag 
cctcgcggca 
gatgaaacaa 
tggtgaaggt 
aagcaaaggg 
tatcacatcg 
gttccattct 
gagatcgaag 
atccacttcg 
tgcttttctc 
cagaagacga 
ccaagaagcc 
gactataacc 
taaataatcg 
ttaatcaaag 
tttggatcgt 
attaaaactt 
ttattaaaaa 
ttggtgataa 
caataaactt 
gatatactta 
ttaaacttaa 
atttttgaag 
atatataata 
caagaaattt 
atattttatt 
cat gt tat at 
atgaaaaaat 
gcacatttta 
gcactaacca 
cacttttccc 
tttaggtaac 
cagggaatta 
cccgcccttt 
tttttgtaat 
aatatgtcaa 
tttatttcta 
aaaatattcg 
cagtgttttg 
aaatcagatt 
gaacccgatt 
ttaaaatctg 
ccatttatta 
ttggtatatt 
tatatgcata 
cgtcaagatt 
ctaacttcgg 
tagcgctacg 
gatacgtgtt 
ccgcttcacg 
caaactccac 
gagcatcaat 
acgcttcaca 
acggtcaacc 
ccctaataaa 
gagactagaa 



ttgcaacaac 
gttgtaggta 
agaatcgcta 
attgggtttg 
gtcgcccatc 
accattttcc 
aagaccattc 
atccaaggct 
aacaacatca 
gagcttagag 
aatctgtttc 
caccaccgca 
catgtcactg 
cagtttcaaa 
ggccagagca 
agaggaagac 
aacgacgagt 
aaatcactga 
gttcaaattg 
ttggttcact 
gccaaataca 
taaaaaccta 
gttgaaatta 
ttttttctga 
gt at at tact 
aaattttaaa 
gtgaattttt 
ttatttatat 
ataacttttg 
ttataaataa 
atttatattt 
atttatataa 
agtagtttat 
aaaagttcgg 
ttagttcggg 
gggatgaacc 
taaccatttc 
cacaagataa 
gatggcgggt 
tatatttgta 
gtacgtaact 
aagtttacaa 
tataaaaaaa 
aaacccgacc 
taggttttat 
agtgactaat 
ttttgtataa 
ataatatttt 
ttgaggtttt 
ttctttcatt 
ttatattaaa 
gcccaaatta 
gaatgctcgg 
cacatcctcg 
ctaccaccgg 
acatcctcat 
cgtcgctacg 
tgcatcacca 
aaacccaaac 
ctaggagccg 
ccgatgacgg 



cgctgggctg 
actatatttg 
tggcgtccat 
atgtggctta 
ttacacatcc 
accatccgat 
atcagcgtgg 
tcagaaaccc 
gggtggagtc 

caactgcaga 
ctttccatct 
cccattagct 
aacaaatcaa 
ctctctcctc 
tgacgaatcg 
tcgaatctgc 
gagtgaggag 
tttgaaatca 
attttacact 
tgggatgaac 
acccgaacta 
cccgaacttt 
cataaattat 
tttatataaa 
aaaatataaa 
gttatttata 
tttcaaattt 
tttgaacctt 
aaaaaaatca 
cttcaaaatt 
tagtaagata 
attagaaaaa 
gtaatttcaa 
gtaggttttt 
ttgtatttgg 
ttggccagaa 
tagaatcaga 
aaatggataa 
aatttttgtt 
attattaatt 
atatacttgt 
catattatat 
atctgacttg 
cagatccaag 
aaaaaataat 
agataacttc 
cttttagtaa 
aattttatta 
tctattttgt 
tggttgatgt 
taaaggccag 
caatccgaaa 
gatgtcagcc 
tcctcgagag 
ggaaccacac 
cgcgaacact 
agatctcatc 
accttcccgc 
gaaccacaaa 
aaagccggcg 
tggaaccgaa 



cactccatct 
gaaagattcc 
ctttacaaag 
tctcctccat 
catctacaaa 
caagcagagc 
tgaacgttag 
tgtgatcaag 
caagcttggt 
aacattttat 
tttgatagag 
tgttgaaatc 
tcgcatcctc 
taaaacctgc 
atcccgtaca 
aaaccctagc 
tggtcaagtc 
gaaaacaatg 
gaccggttca 
cgggaaaagt 
atggttagtg 
ttaaaacgtg 
tattttttca 
tacataacat 
taataaaata 
aataaaaaat 
attattagat 
ttacctttca 
cttaagtttt 
ttaagtatat 
taagtttatt 
attatcacca 
ctttttaata 
aaagtgttaa 
cacgatccaa 
tcaacctcgt 
ataaaattgg 
ttagttcttt 
ttaatttttt 
tagtttgata 
actatatgca 
tttttagtag 
gaatcgatcc 
attgaaccgg 
attttaaaag 
tactttttat 
ataaattagg 
gaaatagaat 
gggtgcattg 
aatcgtagat 
agcccaaaag 
cgggcctaca 
gccgcgtaga 
acacgcgtca 
ctcagttcac 
ttttcgatct 
ccccgtagtt 
cggagacctt 
gagaagaaaa 
gcgcaagaca 
gaggcttccc 



accagagatg 
cttgtcttgc 
gccatcaatg 
cttcctcaga 
tgttccgtaa 
tacggcttcg 
gacatctggt 
acataatccg 
gagcttacca 
cagaatggtg 
agaaatcaca 
aatcacagaa 
taaccctttg 
ttcgccactc 
gaaaaatcta 
caacatttcg 
gtgaactttt 
accgtgtcaa 
actcctgttg 
gcctattcca 
ccaaatacga 
cataaatcta 
tcgtattaaa 
gtttttactt 
tttataatta 
gtatagatta 
attttaatat 
aaattgaaca 
aaaaataaaa 
ttgagatcat 
gaagatataa 
aagttttaaa 
atatttcaat 
ttgagttcag 
attgttcagg 
acccggtctt 
ataaaaacct 
actttgtaac 
tttgagattt 
taattttggt 
tttcaaagat 
ctactaaata 
gaaaattaaa 
taaatccggt 
tcaaaaatca 
tttagttttt 
ttttttaatt 
atattttgta 
gttttacgat 
ggagtagatt 
cttacaacca 
gcccaaacaa 
tctaaccttg 
cgcaggccag 
cggaacccaa 
aggatggctc 
tatcgcacac 
caccctcctc 
gaaaaaaaaa 
gtggagatca 
cgtcgcggat 
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164581 acgaaggccg gcgtcgatgg agctaaggta 
164641 aaagatgtct tcaccgcgcc atcaacccac 
164701 ccgctgaggt ggctggccag agttcggaca 
164761 aacatcagac taaattttag cttaaagaaa 
164821 gcaccgaccg accgccgggt ccaaaccacg 
164881 tctctccaaa ccacgattca cttttttgtg 
164941 tgtattttta atgtaggttc tgactaatat 
165001 aaataaatat aattatagga agcacaaaaa 
165061 ctataaaagg tattctgatt taattatatt 
165121 tttacaagca acaatacgct tttagttcgt 
165181 tgagaacaat cagcaacttt tgatgatggc 
165241 ccataaatat gtctcatatg aacttgaagt 
165301 agtgcctccc caataatcac agtccacaga 
165361 attgttggac atcagggcat aagcttcctt 
165421 gatttgaaat agccaaatta aatagagaac 
165481 cttaagatca ctactgatga tttagacaaa 
165541 ccgaacagag cttcagtaac gccatatgca 
165601 gtcattttcc tttttcagat ataactaaaa 
165661 ctctagtagc cgtgtgaaaa tactagtaag 
165721 accactctaa caagttatat cgctgttctg 
165781 aaaaaaccga gacgttctta ctcaaataat 
165841 tcttaactct ccctgctacc atgaaaccgc 
165901 gacgacctca aaatcacagc cacgcaaaaa 
165961 tgattctagt aacgagaaat gaaggacagc 
166021 atattaatct caatgcactt tggattgtca 
166081 cgttgatcaa caagagcagt ttttctgttt 
166141 cagctctaat acaataatcc caatacgtta 
166201 tcataaaata cacaattgta tatatcatcc 
166261 aatactaaaa caggataagc aagaccacct 
166321 agaatctaga atcgcaggga aataacacag 
166381 ataagcatca tatacatata tattcaaatg 
166441 cattgaagta ttgttctcac ttactttaac 
166501 ttgactcttt ctttgatcca accactggga 
166561 tgaaaacaat tagaatataa catttgcatc 
166621 cttcttcatc ttggacattg tgtgatgttt 
166681 gtttataagt cggagccaaa ccgttttagt 
166741 actctctaac cggttttact atcaagaaag 
166801 ctcgtctctg gtttctttgt tgattaacaa 
166861 agtttattct acgtatggta aattgatttc 
166921 agcaagtcaa gaggcgaacc agagaagcat 
166981 aagaacatga tctcctcctg aacccacatg 
167041 gaagaagagg aggaagatgg catcacagct 
167101 taaccatggt gacagctaca gaacgggcag 
167161 ttctggaagg tgtgcctctt tagttcagac 
167221 atttaattgg aaatggtcaa atttattaag 
167281 ttagaatagc agcaaaataa cttactaacc 
167341 aacgatttga tcaaacatta gtagggtata 
167401 agctctgctg cgcaaacaga agaaaatctt 
167461 acaatgcaaa accaaaccaa tggttaattc 
167521 cacaaaagaa tgtatacatt cttgtttata 
167581 cataccatac tcatctgcag tttctcaagc 
167641 ctccataaac cagtcagcat attgcggatg 
167701 .atcatctcct ggaaaatgtc tagagcccca 
167761 atcaaagtga tgtaagtaat tgcgttagca 
167821 agctccagcc catcatcaac ccttcctgcc 
167881 gtcactacgt ttggagagaa gctcttgcta 
167941 tctaggcggc tctgcttgca taatccatcg 
168001 actatacccc tgtggggcat ctcctcgtat 
168061 ttgatcaagc cgctgatcaa tatattgtaa 
168121 gggtgactag catcaagatc cttcttactc 
168181 tcttttagtt tcccattatc gcagagacca 



gcctccatct cccagaagta gacccaaaca 
caccgcgtcg ttgctccaag aacagaacca 
aagcagatgc cggaggagat agagaggaaa 
acagacaccg acggcggcac ggacgctaac 
attcactttt tctctctttt ctttctcttc 
ccttgttcta gatggagtag attaagttgt 
tagttgaatt agttagtata taattagggg 
gtaatatgcc aaggaaaatc aaaggtaaat 
gatagtataa gaatcgatac taaataccgg 
ttctttggca agaacagaat caaagcgttg 
tatactgcag cttttccaac acttttggtt 
taagtgcttt ctcaagaaac acgcttcttc 
tcagatcagg gtctgcaacc catgtttaat 
gtcacaacct gaaaacacac tcgaatcaat 
gttaaggatt ataaccaatg gagttaagaa 
agaaaggaca gaccctggat ttcaaataaa 
accaatagag aaacaacaca ccaattttct 
ctagacacat agaaagtttt aagatgtcca 
cttaaggtat aataagtatg agagcttttc 
attgtgataa agttattgct acccccaaca 
ggctgtatta tcttcttttt gcaatttcac 
atggatctgc attgtcaatc taaacaccat 
caaagacggg ttcacaaatt acaaaccaac 
acaatggagt ttcaaatgct tgcagcaatc 
catgttgggc atctcagtta cccatctaca 
gctaagaaat caacgagagc agtttagtcc 
acgtgttcat tatccactaa aaactgaaat 
gggatgattc atccccaaaa ctccgactct 
aatctaaata ttaagaagga aaaacaaatc 
accctatttg cctattctca tagcttcatc 
cgatgattac acaggttgtc acaagaaaag 
tcagttttcg tcaccttaca caatgcaatg 
gtaatctctt aaaaaagtta gaccaagcac 
tcctcaccca gagagatgac aacacaaata 
gagagagaga tgcaaaacgg tggggttccg 
atcgattttt agttcatatg atcaatctct 
aaaaaagaga agcttataga gtttgtattt 
aagcttaagc aataatagaa tgaatctgca 
atagcaattt ggagataaag aatgaagcta 
aagcgacaac aacatttcca ccaagattaa 
ggagcagaca aggcaggaca gagccaatat 
atcagctctc atgaagaccg agcttgacga 
acacctaaac atcttttaac acctgaaatc 
caataatctt gcgtgaaaca agtgtatgga 
cccagacaat agacgcataa aaggctgtca 
acacaatttt tcactatgat cactctattc 
catgcagcag cagaaacaaa atgacattga 
tgatcattca tcccccaaat gatagatcct 
cagaacgtgt atcatcaatc tgaagctaca 
taaaaaatac atagactgaa cagaaactta 
attgccactg ccctttttag ttcctcttta 
gtaatggtat caggatacac accacttgaa 
ttaatattac ccactttacg aaaaccacaa 
actatccctc ttcgacccat ctcgcagaaa 
ttacagtagc cattaatgag tgtagtaaag 
cccatcgaat caaacatttg tgtagcctca 
at cat t gage tataggtgat agtatctggg 
aattcctegg cctctaaaaa cttcccttca 
gtttgaacat caggttccac accattgaag 
ttctgcataa ccttaaacat ttccaatgea 
tccagcaaag tgtcacaagt aacgatatca 
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168241 gggcacaaac cactagagat catctcttgt aaaaggtcta gagcagcatt aagatcgccc 
168301 accagataga acccgtgaat aagagtgttg taagtagttg tgtcagcaac taatcctgtt 
168361 tcagtcatct catggagaag ttccattcca tcatctatcc tcttagcccc acaatatccg 
168421 tctatgagag tattgaaagt gattaggttg ggagagcagc ccttggtagc catcaaataa 
168481 aacatgtgct cagcagcatc aagacgattc tgtttgcaaa atccatcgat cattgaacta 
168541 tatgtgattg tattagggat tatacccctt ggaagcatct catcgtataa ttcttcagcc 
168601 tcaaagaact tgccttcctt gacaaatgca ttgatcaaag cattataagt tacaacatca 
168661 gggctgatct tcctttctaa catttcttgc aacaactgct ccgcgtcgct ccatctacca 
168721 gagctacaaa aaccaactat catactgttg taggtaaata aatcgggaaa gattcctttc 
168781 tcttgcattt cagtgaaaag attttgtgca tcgctatgac gtccgtcttt acaaaggcta 
168841 tcaatgattg cactatagat tacaacattg ggtatgatgt ggctcacctc ctccatcttc 
168901 ctcagcagat tcagtgcaga cacagtatct cccttcttac acatcccatc tacgattgtt 
168961 ccataagtaa tctgggtagg ctggagacca tcttccatca tccgatcaag cagagctacg 
169021 gcttcgacaa ttctaccctc gcggcaaaga ccgttcatca aagtggtgaa ggttacgaca 
169081 ttgggcctac atgtcgtttc aaacatttga tgaaaaaaat ccaaggcttc agaaacccta 
169141 tcttccacac ataatccatg gagcagggtg gtgaaggtaa caacatcagg gtggagtcca 
169201 agcttggtga tcttaccaaa tgtagacaaa gcaaagggga gcttagagca gctgcagaaa 
169261 cattttatca gaatattgaa gctgtatata tcacatcgaa tctgtttcct ttccatcttc 
169321 tgatagagag aaatcacaag atccgggcgt tccattctca ccaccacacc catcaattta 
169381 cagaaatcaa ccacagaagg taaaggacga gatcgaagca tgtcactgaa caaatcaatc 
169441 gcatcctcta aacctttgat ttcatgaaac ccactttgca gcttcaaact ctctcctcca 
169501 aaacctgctt cgcaactctc tccgcttgcc ttggccagag tatcacgaat cgatctcgta 
169561 cagaacaatc tagccgcaga ctcagcagga gaagaagaac acttgaatcc acaaacccta 
169621 gccaacattt tatttttgtt tcgcctaaat tttgttcttc gtcgatctct aaaaaaaact 
169681 cgtgactgag aaaataaacg agagataaag tacaaacggg ttccatttgt tttacaagaa 
169741 ccggttcagt ttgaattata caatttccgg ttcagatttg tcaccagttg attgattgag 
169801 ttccttgttt aaacccacag ggccacatga ttcactaaac ccagagccga gagaaattga 
169861 taaaccgagc tattgaaacc agtagagttc cggttcagta tttattacta agccttcttg 
169921 tttagtccac ttttaaaaat caagacgacc aatcaaagac catttacaga ctatccactt 
169981 ctaaaattat taaacgatca taaaaattta tcaacagcca ggtgtgttaa ataattatat 
170041 tagaggaacc agttaatgac atttttattg aaccagttaa tgacattttt attgatgtta 
170101 taagctaact taaaatataa acaaagttta tttactttgg gagaaccttg gccatcaata 
170161 tcatccccta tttcattcaa ctcttaagta actaaacatt tgactcaaaa ctaaattgga 
170221 tatgtacata aacaatgagt tggacatacc acactgcaat gagaaggaat cgagaatctc 
170281 catactaatt agaaagacct aaccaaaatc taacctatta atttagggtt ctatttttta 
170341 tctactgttc agagtagggc tggacaaaaa aactgaatcc aaagaaccga accgaatccg 
170401 atctgcaaaa gtaataccaa atccgaaccg aaattaattg aatatccgaa taggtttaaa 
170461 tttttggtat ttaaagaacc aaaaccgaac ccgatccgaa ccaaaatatt ttggatatcc 
170521 gaatgtatcc gaaatagatt aatatacata tatatttaac tatttttaga ttgaatatat 
170581 attaaaaagc atctaaaata tatataatac ttttaaattg ttcaaaatac tagaaaatat 
170641 ataaaaacat taaaaagtac atgtataaat agttaaagca tactcaaaac accaaaaata 
170701 tataaaatat gattgatttt ctatccaaat atccaaatca aaccaattta catgttaagt 
170761 tttggtactt cgacacatat tattgaaatt tatatgtaat atattatttt gtttacaaat 
170821 ttcgaaaagt ttaaaatata taatgaaatt ttaaaatttt gaaaataatt taaacgggtt 
170881 atccgaaccc gaaccgaatc cgcaaggatc cgaaccgaac ccgaaccgta atttagaaat 
170941 aaccgaatgg ggctaaaatc tttgacccca aaaatccgaa atccgaatag actcgaaccg 
171001 aaacccgaat aggtacccga acgcccaccc ctagttcaaa gattaggacc acttagttaa 
171061 atatttaata tgacatatat gtttatttat attaaatctt aataatcaaa aaaaaaaaaa 
171121 atcaatttta acaatacatt tgaattttat tttagctata acaaaatttg ttgaaaaaaa 
171181 tctaacaaaa tcctacccaa cgttttaaat atttattata aaataatgta ttaattaatt 
171241 tttgcaatta gtaatataat aatattataa ttattgttat ttagaatttt atcataaaac 
171301 aatgaaaata aaaaattatg ctattttata aatttatcat tataatctat gttgtactca 
171361 aatagaggtg ttctaaacta aatataaaag atgtcaaaaa aaaatgaact aaatataaaa 
171421 taatatatta aattggttga ttaaatctat ttttaaaata cttttatgta aataaattta 
171481 tcactcacgg ttaaaacatt gtaagagcag ctctgctcgt tttctttttt gcttcttctg 
171541 ttctcaaaaa cttgatgaaa tcacagaaac tgagtcaagt atatgatcag gaacacagaa 
171601 caagtgataa acaaaatcta tgaatatgtt gttagtgata ggagataaaa cactacttga 
171661 ttccatggaa aacgtcacca acggcgcaat aagaatcaat agagaagcgt agctattctg 
171721 taatcataac cgatggaatg cacctacaaa ataaaattat ctagggattg taatttggaa 
171781 aagaaaaatg tctttcattc ttttatgggt caattagggg aaagggcaac ttttaagttt 
171841 gaattaggaa aataggccac ctcaaattta tattaggcaa ttgggcaacc tagtagagag 
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171901 agaataaatt tatgacattt ttaacctttt tgcctattaa acttggcaag ttggaaaagt 
171961 aatttcgtgg gacccaaaat ccactttgcc tataaaaaat aatttttttc ccactaactt 
172021 ttaacaacta acttttgtaa atcccataaa cattctttat tcattaaact tttggatcat 
172081 atttaatatt taaaaatcat atatgagcaa ctacttttaa tataatacat attttttcat 
172141 atataaagtc aacataaatc tatacgaatt aatataaaat cgatgtaaat tctaaaattt 
172201 atgaaagtta cccaataaat tatacatttt gactcataaa agaaactttc atatttatga 
172261 aatctaccca aataaatgat aattcataaa attatgtaaa tttatggata attatcataa 
172321 atctgagtaa atttaataaa cttggaaacc ttcatagatc cttcattttt tgccaaaatg 
172381 gcaggaaaac aaatttgtct acatttaatt ttaatatagt tgggtaaatt taataaattt 
172441 ggaaacctac gtgaatctta catttttttt tggtcaaaat tgatctcaca ctttttgttt 
172501 ttgccaaaaa tagaaacact cacgaaatgc tacatttttt tcccaaaaat aaaatgacaa 
172561 attgggatct ggcagcaaag tgcattcgtt ataattttca tgtttataaa atgacaaaaa 
172621 ttgttaatga aatgacaaat ttgttaataa aatgacaaac cctacacctt tcacgatgaa 
172681 ctgatctaat aactttaatt ttcaaggatc atatctactt gttagaacca atataatccc 
172741 atattgagta caacctgcag atttccaatc atcccaagtt cttgactagt cataggtaga 
172801 gccgtgctta taatctattg aaaaagatat ccgctatagt gctatttcac cccaaaaaat 
172861 agagttatcc ttaggttcac ccctagagtg aacatttagg ttcacccaac caataggaat 
172921 caagtatttc ataattaata ttttttttaa aaagaaaaga aaatattgtc aagttatatt 
172981 atgtttttaa aataaataaa atataaaaaa aaaataatag ccgttacaaa aaatgaattt 
173041 ttgaaaacta tttttaatat cgtcaaaaaa cactaaacct taaaccctaa atcctaaacc 
173101 ctaaaccctt gggtataccc taaacccttg gataatttta aactctaaac cctaaaccct 
173161 aaattctaaa ccctaaaccc taaatcctaa accctaaacc cttgggtata ccatagaccc 
173221 ttggataatt ttaaactcta aaccctaaac cctaaattct aaaccctaaa cccttggata 
173281 aatcataaac acttggataa tcctaaattc taaatcaaaa acactaaaca ctaaaacatt 
173341 aaatcttaaa aatactatta tggtttaatg tttttaattt agggtttagt atttatccaa 
173401 gggtttagga tttagagttt agagtttagt gttttgttga cgaaattaaa atctttttaa 
173461 aaaatctttt tttttgcata tattattatt tttatttttt aatattttta ttttaaaaat 
173521 gtaatataac tcgacaatat tttgtttact tttttaaaag atatcaactg tgaaatgagt 
173581 gatttctatt ggttggtgaa ccttaaggtt tactctaggg gtacaccaag attaagtcca 
173641 aaaaatatag gataatttgg tgcctccatg tcaagtatct agtgattttt atgtcttaag 
173701 aagttaagac tcaacctgag tttgaccaaa ttctctcgca taaacttctt atcttaaata 
173761 tttaaaatca tcaacaaaac actaaacata aactcctaaa ctctaaacca tgaatcctaa 
173821 atctggaatc cttgggtaaa tccggaaccc ttgggtaaat ccagaatccg aataaattat 
173881 acattttgac ccataaaaga aactttcata tttatgaaat ttacccaaat aaatgataaa 
173941 ttattcataa aattatataa atttatggat aattttcata aatctgggta aatttaataa 
174001 acttggaaac ctacatagat ctttcatttt ttgtcaaaat ggtaggaaga acaaatttgt 
174061 ctacatttat cttaaatatt aaaaatcatc aacaaaacac taaacctaaa atcctaaact 
174121 ctaaaccttg aatcctaaat ccggaaccct tgggtaaatc cagaaccatt gggtaaatcc 
174181 agaatctgaa acaatctgga acccttggat aaatccggaa ccctatgtct aacgtttaat 
174241 acaatctaat tcactataac tcaagaaata agtatacgta tacataagat gtacgaatat 
174301 ctaaggtgtt cgaatatata aggtgtacgt atacataagt atacataagg tgtacgaata 
174361 ccttaggtgt acaaatacct aaggtgtacg aatacctaag gtatacagaa ggtgtacgaa 
174421 tacataaggt gtacgaatat caacataact catctaatac ttaagatgta caaatacata 
174481 aggtgtacat atacataagt atacataagg tgtacaaata cataaaatat acgaatacca 
174541 acataaccca tcgaatactt aaggtgtacg aatacacaag tatatgtata cataagtata 
174601 catatattta aggtatacga atacatcggt gtacgaatac ataacatcta tgaatacctt 
174661 ttgtttatat cccttattgc attggagagt ttaaatacat attaagtata aaatttaaat 
174721 acttctacga atacaacgag agcacagtgc tctccaaaac aattatacgt taattattta 
174781 gaactcacat tttactctat ataaaattta aatattttgc taataaccta ttggtattat 
174841 aatctaaaat tttgtcaatc tcattataga tgatctaatt aactaaaagg tggtaaaaac 
174901 aatgaaataa tttataacaa catttcattt gtaaaagctg atataccaaa atattttcat 
174961 taaattcaat aaaataaatt aacatattag tttaattgtg ttttcatata tgtgaagcac 
175021 attcatttag acaacataat ttttcatata aaatttaaga ttgtttagaa atgaaatgtt 
175081 gaaataaata agtgaaatac atcttataat aaaaaaacta aacaattaaa tattttttat 
175141 aataaaaagc taaacaatta aaaaactaaa caattaaaga ttaaacaata aataattgac 
175201 attaaagaca tctaaaatga aatataatga aagatttctt taaaaaaaat acatcatatt 
175261 tttattatga aatatataat acagttacaa aaaaaatcaa taaaatcgaa ataaaatcta 
175321 gataattttt caatagaaaa atgacatttt ctattaattg tttagttttt ctcaataaaa 
175381 tcacagccat gaaaactaga attttagatg gatataagat gttttaatag aaaatagaca 
175441 ttatattatc ttgtattatt caaagatttt ttaaaaacta aattattaaa agataatgaa 
175501 atataattta aacaattaaa agataatgaa atattttttt aaaaaccaaa caataaaatt 
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175561 gaaatctaac ctagataatt tttcccatta gattactttt aagaaatcaa tttttaaatt 
175621 ttaataaaca tttttagtat ttatactttt aataattaat aaatgatata tagtattttt 
175681 gtatgatatt aagcattcat tttaatcatt tttagtgtat aaatattaca tttaggtgtc 
175741 atatgtataa atttcataaa caggggcttg gaatgtacat gagcgtaatt gtttcttatt 
175801 cttcagttga agaaaaaatg atttttcttc ttcagttaat ttaattgtaa aggtggtggt 
175861 gtagaaaaga atgagagaga aaaaagaaga tgagaggaaa agaaggagag ggagatgatt 
175921 ttcgtagatg atggttggtg gtgtggaaaa gaaggagaga gaaaaagaag gaaagagaaa 
175981 aagaatgaga gaggtggtgt gacaaattaa agaatgagag atgtgatgtg acaaatttta 
176041 ttttagatat attttaattg aaatggcaaa gatgtaaata atgaatctaa caaaggtccc 
176101 aaaggatatt tagacataag tttacatggg caaatgcaaa aagttgccca attggctaat 
176161 ttaaacttga ggtcgcccaa ttccctattt caaacttgtg atttgcccaa tttgccaatt 
176221 ttttcttctt ttattcatcc tccaactgat atcctgcaaa acaaacacac caaaccgttt 
176281 atgcaagctt ccctgatgct acagcaaaag ctgcagctct tctccagaaa cttaaaggta 
176341 agcaacgaaa tttgaaaaca aaattgttga atgacatacc cacactcatc tgcagatcct 
176401 caagcattgc cactgccctt tttagttcct ctttactcca taaaccagtc agcatattgc 
176461 ggatagtaat ggtatcagga tacacaccac ttgccatcat ctcctggaaa atatctagag 
176521 ccccattaat attacccact ttacgaaaac catgaatcaa agtgatgtaa gtaattgcgt 
176581 tagcaactat ccctcttcga cccatctcgc agaaaagctc cagcccatca tcaacccttc 
176641 ctgccttaca gtatccatta atgagtgtag taaaggtcac tacgtttgga gagaagctct 
176701 tgctacccat cgaatcaaac atttgtgtag cctcatctag gcggctttgc ttgcataaac 
176761 catggatcac tgagttatag gtgatagtat ctgggactat acctctgtgt ggcatctcct 
176821 cgtataattc ctcggcctct aaaaacttcc cttcattgat caagccactg atcaatatat 
176881 tgtaagtttg aacatcaggt tccacaccat taaaggggtg actagcatca atatccatct 
176941 tactcttctg catagcctta aacatttcca atgcatcttt tagtttccca ttatcgcaga 
177001 gaccgtccag caaagtgtta caagtaacga cattagggca cacaccacta gagaccatct 
177061 cctgtagaag gtcttgagca gcattaagat cgcccacctg acagaacccg tgaataagag 
177121 tggtgtaagt aattgtgtta gcaactaatc ctgcttcagt catctcatgg agaagtttta 
177181 ttccatcatc taccctctta gctctacagt atccggctat gagagtattg aaagtgatta 
177241 tgtccggaga gcagcccttg gtagccatca aataaaacat gtgctcagca gcatcaagac 
177301 gattctgttt gcaaaatcca tcgatcattg aactatatgt gattgtacta gggattatac 
177361 cccttggaag catctcatcg tataattctt cagcctcaaa gaatttgcct tccttgacaa 
177421 atgcattgat caaagcatta taagttacaa catcagggct gatcttcttc ctttctaaca 
177481 tttcttgcaa caactgctgg gcttcactcc atctaccaga gctacaaaat ccattaatca 
177541 tacagctgta ggtaaataaa ttgggaaaga ttcccttgtc ttgcatttca ctgaaaagat 
177601 tttgagcatc ggtatgacgt ccgtctttcc aaaggccaga ttaccacatt gggtttgatg 
177661 tggctcacct cctccatctt cctcagaaga ttcaatgcag acacagtgtc tcccatctta 
177721 cacatcccat ccacgattgt tccataagta atctggttag gctggagacc atcttctagc 
177781 atccgatcaa gcagagctac agcttcgaca actcgaccct cacggcaaag accgttcatc 
177841 agcgtggtga aggttacgac atttggttta cacatttgat gaaacaaatc caaggcttca 
177901 gagatcctgt cttccacaca taatccgtgg agcagggtgc tgaaggtaac aacagtggga 
177961 tgaaaaccaa gcttggtgat cttaccaaat gtagacaaag caaacggcag cttagagcag 
178021 ctgcagaaac acttcatcag gatggtgaag ctgtatgcgt tacatggaac ccgcctcatt 
178081 tccatcttcc tatggagaga aatcacaaca tcgagccttc ccatcctcac cacaactccc 
178141 atcaatttac agaaatcaat tactgaaggt aaaggacgag atcgtaccat atcaccgaac 
178201 aaatcaatcg catcttctaa ccctttgatt tcgtgaaatc cgcttcgcag cttcaaactc 
178261 tctcctccaa aaccactctc tccatccctg cttttcttgg ccagagcatg acgaatcgat 
178321 ctcgtacaga acaatctagc cgcagacaca gcaggagaag aagaagatcc ggatctataa 
178381 accctagcca acatttttgc ttcgcctgaa ctttgttctt cgtcgatctc taaaaccact 
178441 ctcgactgag tggctagtaa accggttcca tttgatttac aacaaccggt tcaatttgaa 
178501 ttatacaatt tccggttcag agttgaaccc actcagcgcg tcggtcccgt caagctaata 
178561 atttctttaa aaaaaaaaaa aattctctcc tctttgggca gttaacaata actaaactga 
178621 tcaaccaatg ttccaatgta acaaaaaaaa aacattatcc tataaaagta gttaaacatt 
178681 accctataaa agtaggttga aagattttca ccgaaatcaa gttaaataga acgataatac 
178741 cgactgatta agctggtctg acacattcta tgttatagaa ttttccaatt ctgactatga 
178801 gtaccaaaag agaataaacc aaaaaaaaaa gactttctta ttcgaagtaa ttgctggaat 
178861 gctggattag gtaaggtcac atcttccttt tgtaacttca ctctgactct cctacaaaga 
178921 catgttcaca aattacaaac caactcattc ttgttatgag aaatgaaaga cagcaaaatg 
178981 gagtttcaaa tgctttgcag caatcttatt aatctcaatg cactttaggt tttcacatgt 
179041 atcactgaaa ctgctttttc accaggtagc tacaattttt ctttgaatta ctaaccattt 
179101 cttggtattc ggcatcctaa ttctcatcaa caaagcaatc aacaaagagc aatcaacaag 
179161 agcagattaa tgccctgcta atacaaatac cagcagacca tcctgatacc ttaaaagtgc 
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179221 tcaataagtt attggcatta accaactacc taaactccca ttttgatgtt aaactaccta 
179281 atctgaaata ttcagaatga atttttggga atcctatttc cctcctgctc gaccctattt 
179341 gctcagagct cataattggc atcataatta tatccgcaat attttccagg ttacgtcaaa 
179401 tcaaccaagt tacaaagaaa atgatgaaca tattaaaact aataacaact ttaggaatct 
179461 atatttttat aagagctcag cttacttcag ttgagtactg atcttacgtc aacaagtgga 
179521 tcagatcttt tttgtcatag gctcatagca gacaaatcaa ttcacctcac aactcttctc 
179581 tcgagggtta agaaatcatg ttgaggaagc ttttgtccaa tctcccatca tataacatat 
179641 tagtgagtaa accaaaggtt gaagcatctt cagaaaaccc gcaacctctc atttctttga 
179701 tgagttctgc tgatacagaa atgtcaccat ttctaagatg tgctctgatt agcacattat 
179761 aggtacggtc atctcaaaga gacccatctt ctttcatttt tataaataag tcatccgctt 
179821 cctgtagtat agacgttttg tacccaatcc tgagattatc gtattgtatg ttataacatc 
179881 gggctgcaat cctttgaggc tgaggctatt gaataaatgt tgtacgtctt ccactttccc 
179941 agcctcgcac atcccttcaa tgataatgtt gtaagtgaag atataaactt ctatttgagg 
180001 ctgaggcctg aaaaaaattg gatattttca aatacttagg atattttgga taaaacatat 
180061 taggataatt ttggatgaat caaatatttt ataataattt agtttttcaa atattttaga 
180121 tatttttaat agatttttaa attttatata tattttttgt tatgtaatta tgttatatgt 
180181 atgtaatata tatatatata tatatatacc ctttcgatta tatattcaag tgttcgttcg 
180241 gtttcaattc tgttcggtta tttcgaatat ataaatttag caatcattcg aatatttaag 
180301 agttttagtc cagtttaatt atgggtattt tgatttggat ccgtccggtt cagttatagg 
180361 tatttctatt cagttccgat tcatttcttg ggttccaatt tttttgccca gaactactaa 
180421 ttagcaagac ataccaaatg aacacttatt atctatttcg ttactgaagc atctaagact 
180481 cactggaact tggctggaaa atatgtttac aaatgcgatc aaatatatat gcacaaattt 
180541 ttttaaaaaa aaactacaca aagacagcca tatacaacaa ctgacaacgt caaatattta 
180601 ttacaaaacc aataaagaga gacaataaac tatctctaag atcctgaaca taatttggga 
180661 ttctaaattc agattctata taactactag atcttaagga gaagccttga aagttagcta 
180721 agtgtgcctg attgcatcac agggaaggac caatcgctgc tgttcttggt cgaatctcta 
180781 tgctcatcat caccggtcaa atgtatctca tcggtggtgc cgaaatcgaa ttctttattg 
180841 gatccggagg aggaatgttg agctgcttct tgatcagacg cttcagggaa cgttgttctc 
180901 agcttcttat caacgcatct aatgaaatga tccgcatcaa gatcgaacga aaccggtttg 
180961 ttcggtctca caatcttgtt ctgctcttta ggagttgtaa cagacttgga gcaacgcagt 
181021 ttaccagggc tcagcagctt tggtggatca gagacttgaa actgagcgag tccgtcagga 
181081 aacggagaag tcgcaccaga accggacata accgagctgg gggagataag ctgagcgagt 
181141 ggactaccgg gaggaagttg gtaaaactga aactcgtagt tagacaccgg agacctgaca 
181201 ccgtagttac tgctggagtt aaagagctga gcgaaaggga cttcaggcga agacggtgtg 
181261 gtggtggtta agtagaaaga ctcgtcgaga ggcggcgtga ccggagctga agacggttcg 
181321 gttgtgtaag tagagaaaac cggaggagaa accagctgag gttcgtgagc gtaaggtccg 
181381 atggcgaaga tcgaaggacg ttcttcgctg ttgttgttgt tgttgtggct gttagaaggt 
181441 agaggactaa aggagaggat ccctacaggt gactgtgtag cggaaggagg ttctgattgg 
181501 aagaaggaag ctggagagga aggtggggct atgaaaggaa gagccgtcat aaccgaacga 
181561 taaccggaat tggatgtgga atcggtagga accggttcag gagcaagagc agctttcccg 
181621 attcgttttc ttctctgtgt tgaaggtctg aaacatatga aacgattcca ccatcttcgc 
181681 ttcttctgct attcaaattt tttttttttg attgaagtca cataaacaaa ttagaatcta 
181741 ttgtgaatga gtcaatagag aaacgtagct atggagttcg cctacaaact ttgttcgatc 
181801 taatgctttt gcgataacga aatcattata cagttctcta gatgatcact aaaacctcga 
181861 gaaaatgagc taaatccgat gaaataggga ctcacatgaa tcggggaagg ttggtgatga 
181921 acacgatcat cagaggaagc gaacgcagta gcggctgcgt ttatagtctc caaaacgttg 
181981 tttccactcg cgccgcctct catcttcctc tgatcttata taactttccg gagaaaacaa 
182041 aacagagcat tcgaatttct ctcaaagtat caatacacag aaaaaaaagg actatgaaaa 
182101 tcaatgtctc ctgtgagtaa agttgaaaaa caatttataa ataacgaaga cacgcaaccc 
182161 gaaagttctg ggaatattgt ttctctcttg agaaaatata tagatacaca aaccagcata 
182221 cgtgtgtgtt atatgtatgt gtatgtgtgt caaggtttga ggtcaaaaca cggcaaagaa 
182281 caaaaggcgt gcctactatt cattttgtaa attgatgtga tttgaagggt aattaggtct 
182341 attaaacttg cgaagaattt gacttctttc cttttttgtt tttttttgtt tttttttcaa 
182401 attaatttta agataaattt aaatttatca gtttttattt cattctgatt taaagatttt 
182461 cattttatcg tgaatatttc actacttttc tggtcactat ttcactaatt atagattagt 
182521 acaagaataa aatgagatta gcaaattgat ggtgctgttc gtttgctcat ctgggtgatc 
182581 catctaggtg aagatgcaag ttgatgtttg ttttgtacat taaaatgcta catccagatg 
182641 gatcacccaa ctgcatttat gaaaattcat ctcaaattct cacccaaatg agggtgagtc 
182701 ttcacggagc atctggatga aggtgagtct tcaccaaaaa tttgataaat ttttcgttaa 
182761 aacaaaataa attcttgcca aaaccgaaaa atgcaatttt ccgtcaaaat tagaaaacac 
182821 aatttctcgt caaaatctga aaaacataat ttcgccaaaa ccagaaaacg taatttcccg 
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182881 tcaaaatcga aaaatgcaat atcccgccaa aatcggaaaa aggcaatttc ccaccaaaaa 
182941 tcggaaaacc caatttctcg ccaagaccaa aaaacacact ttcacgccaa aaccgagaaa 
183001 acgcaatttt ccgtcaaaac cgaaaaacac aatttcccgc caaaaccgac aaacataatt 
183061 tcccaccaaa atcgaaaaac gcaatttccc gtccaaaccg gaaaaacaca atttctcgtc 
183121 aaaaccggaa aacgtaattt tcccgccaaa accggaaaaa cgtaattttc cgccaaaacc 
183181 ggaaaaacgt aatttctcgt gaaaaccgga aaacgcattt tcccgccaaa accggaaaaa 
183241 cgtaattttc cgccaaaacc ggaaaaacgt aatttctcgc caaaactgga aaacacaatt 
183301 tcccaccaaa accgagaaaa cgcaatttcc cgccaaaact gagaagacac aatttctcgt 
183361 gaaaaccgga aaaacgcaat ttctggccaa aaccgggaaa acgcaatgtc ccgccaaaac 
183421 cggaaaatgt aatttctcgc caaaaccgga aaaacacact ttcccaccaa aaccgagaaa 
183481 acgcaatttc tcgccaaaaa cgaaattttc cggcaaaacc gaaaatgtag tttcccgtca 
183541 aaattggaaa aacacaattt ttcgtcaaaa ccggaaaaat gcaatatccc gccaaaattt 
183601 taaaaacaat tctattttaa ttattttaat aacaagttca tctcgatgta gatgcaagtt 
183661 gaaaaaagca aacaaacata gttgcattta gatgattcat ctggatggat aaacgaaatg 
183721 cagagacgaa caacatctag atgaagtatt tggataaggc atcaagatag accatctgga 
183781 tgcatttttg agatgtacaa acgaacaggg ccatattctt acgaacataa agattgagat 
183841 gtttttgttt tattttttat ttttcatgac gagataagtg tcacagaaac gcacgtgtgg 
183901 gagaatcgaa tctatctctt ttagttttag gtttctcgtg tgcacattag tgtggggcta 
183961 aactaagtaa tttaaaatta ttaaataaat tatatatttt gaattagaaa tgatgtcaca 
184021 acttctggcc cctaaaaaca ttattatctc acaaagccaa tttcaacttg ttttgcacgg 
184081 tggcttgact tcatgttttg ggtcttgttt acttttctat ctctccatac ccaataatta 
184141 aacgaatttc ctataaatgc tttatggaaa ccataattca atcttttggc acattataat 
184201 ttaatc^tta aatagcccgc agaagctgat agccactttg taacttgtta gaaaatggat 
184261 tactgtttgt atttgtccaa ttgactgatt cttggttcca aaaaggggac ataacttatt 
.184321 gaatcattta cacaaaaacg gaagacaggc acagttaatg tgatttggtg caaaaatgtt 
184381 tagctgtctg tcaatcaaat aaatacgcag atctttccag attcttgaaa tttttcatat 
184441 tagttgggaa attcatttta atatcccaaa aatatagaat ttgaaagagt tttctgttaa 
184501 attataatag gtgagggtta taaaaacata attactactt aacattacaa taaatagata 
184561 acataaaaat gaagaaaaag atcaaaagaa ttaaaaagac gtttttaaat aagatttttt 
184621 taaaaagttt aagacattaa aaaattgttt tatcattaat ctaatttttg tatttaaatt 
184681 ttaattttaa tgtggagcta cattaatatt aatattatgt tatcaaatat ttttttagga 
184741 tcctttagta ttgatgttgt tagaattaag tctaaaaagt taactatgaa aaagatagaa 
184801 aaactagaaa atgatgactt ttactaccaa gtttaagaat acactaggtt ttggtccgca 
184861 ctttaaaagt gcgagacttt tttgttataa tttagtacaa aaaattcaaa gatgatattt 
184921 ataaaaacat tttattatta tattgaaact ctgatatgga ttgtcgatat caaaatcaaa 
184981 ataagttttt cctaccatat atcttagggg tgggcacttc gattatcttc tcggttcgat 
185041 tcaggatcgg gttggtttgt ttagtttggt tctagtaatt tctaactgaa gtaaatcata 
185101 gttagtttgg ttcggtttct cggttcggtt tagtttcaaa ccgaaccatt cgggttggaa 
185161 aatcttcaat cgaattattt gaaaaagatt tcggttcgat ttgaatcggt atcgggtcga 
185221 ttagttccgt ttgactcggt tcggtttgaa ttttttgtcc acccctgata tatctaaacc 
185281 gtgcatttct tgaaatatct tatcgtggta accgatgatc catactttaa cagcgaggtt 
185341 attttgtttg tacattttaa tttgaaaata taaatatttt gatccgtgtt tctgaagcca 
185401 agtaatttat ggtttgaaat tatagtaaat ttacattttt ctattttatc atatataatt 
185461 tatgttaaga ttcacataga tattatattt taattatttt tcagtgagtt caaactttgt 
185521 atctttaata aaatttgaat atgcgataaa ttatcatttt ttccagctta aaggtggatg 
185581 agtaatcatt tattagaaat gttattttct gtcctattat tattattata aaatattgct 
185641 ttcaatttga ttttaaaact agaaatggaa cacttatttt ggaatttgta aaaataaata 
185701 taatagtata atatatagta tcgaataatt tcaaagttat tttcatataa taaaaatgta 
185761 tatgtatcgt tcttttgttt gaatttaact aattaggtat gtgcagtaaa gtgccatcac 
185821 taagttatgt tataatcata taaactccac ctatctatgt tagcaatgat gataatatat 
185881 attgaaacaa aaaaaataaa aatgtaaatt tacataggaa aaaaacgtat atgtactatc 
185941 acatatacaa ataaaaatgt gactatttaa ttcaaattct atgtgaaact atattaggct 
186001 gaatattttt tgttcagatt ttctaaagat ctcttcaaaa tattttaggt aatttaaata 
186061 ttaaatattt aatatatttc agtggcattt ctgtaaatag tttgaaaaac taaaggaatg 
186121 tttgaaaaat atatgctgtt ttaattgtat tgatttgtca tgttattttt ggtagtcatt 
186181 ggtaaacttt ttacttggtt actctctccg tcttcaaaaa aaaattataa agaaaatata 
186241 actaatattc ataattaaat ttattattta tttaaaaata tactttctga taactatcag 
186301 ctaacaatat ttaattaatt taaatattct caattaatgt ttttaaaaaa tatacaaaat 
186361 agctttaaaa tatataaaaa gtctattttt gtgaaacaaa aaaaaatcta gaaaaattta 
186421 ttttcaggga atagttttta ggaaaaaaca ctttctagaa aactatagca acttcaatag 
186481 ttttctggct taatactttt aatataatgt tgtctgaaac taaaaatcaa accttactga 



140 



WO 03/006622 



PCT/US02/22217 



186541 aattataata agctgaaaca gtttacaaac caaaaaatca acatttagtt ctcatgtaaa 
186601 agtaaatttc cggcctaaga aactgattta tccaaaagta attctatact actcccacca 
186661 taataatata tgtcatattt ggtttcaata tttacaaacc tgtaactttt ttaagtaact 
186721 tggacaatag agagatgtaa gcacatatat ttttattgaa atgatctttt gttaaatatt 
186781 ttaccatcta tttataaaca aatgtaaagt tatttagttc gtagactcgt agttatacca 
186841 ggaaaatgaa tttgagcttt gtgaattaga tttatgtcgt tatctgaaat gaaatagaaa 
186901 gaatatcatc gttaaagttg agcacattag ttagcaaaag aaaaaactga gcacattatt 
186961 gtccattttt ttcttttaag acttgtcggc agttcaatga gttaggtcat caaccgcact 
187021 tccggactcg tgctttgaca catgtcacca acccaccttg cactcaaaga cacaacacag 
187081 tttaaaaata agagatcatg aaactgggat acaattatac tcttattcaa ttttgataat 
187141 tgtactctta ttcaatattg acaacaatta gtttcaaaaa aaaaatttga caacaatttt 
187201 ttttttgcaa tgagataata agatgacacc agctttcacg tgcatcaaag ttggatgaaa 
187261 aaggacaaga agtttacgtg atatttcttc caaagtaaaa gccacaccaa aataaaagtt 
187321 aagaaccata aaagtcaaat taaaataata tactccaaga ttcgaactta tcattaaaag 
187381 ttcctttact gcaaatttac aaagcaagca acagcagatt caataaaggt tcaaatttaa 
187441 tggcagggca tcaggtttgg tgtagaaaat catttggcaa gatagactag aaatgataag 
187501 aacagaacct gaatttcaaa gagaaggtac gagcttcagc attgtaagag gatgtcactt 
187561 tgacgactgg cgtcccagct tgagggtacc taaaatatca atattgtata tccttattag 
187621 aatgaaaacg ataatattca caagacattc cacagtagaa atatgactaa ccattgcaag 
187681 aaattagcaa aatctgcatt gtttgcatca cgcatagcag caaagaagtc ttcacaggtc 
187741 acagcttgct catcgtgtct ttgaaaataa agatcaattc cctgtaacat atgcacacac 
187801 agaagtgagc gtaaggacca cgaaataaga cctagatact ggatacacaa aaaagagtat 
187861 tggaaatcaa cctttcggaa accctcactt cctagtagag ttttgtacat cctcacaacc 
187921 tcagctccct aagaaaacat tatatagtca ccttcagaga ctcggtatat ttggctgtgt 
187981 cattacatat caatttatgg cgcaacctta tctgggaaag ttggggctag tgcgaagtga 
188041 cataggtgtt gatagaattc cagactgcat taaagatctc caacggttaa aaaatcaaag 
188101 tatattttgc tgtccaaaac ttgcataact gcccgagctt cctaggtcgc tcacaacact 
188161 aagagtatat agatgtgaat cactggagac aatatagtac cgttcccttt aggttctgag 
188221 attgaggata tctatttccc tgactgctac agattgggtc gtgaagcaag gagagtaatt 
188281 acccagcatc cgttgcagga atgcctacct ggcatacaca tatacctgca gggttccagt 
188341 acctacctat ctgttcaaaa gcctacctat ttaagttttg tgtggtggtt ttccctaaac 
188401 atggaaatgg tagaacatag tggtattgaa ttactgtgtt acatacgcat gaatggttgc 
188461 cctacgaaga ggagaacatt ttcttttgat cttccaatcc aatcagaaca tctgtttata 
188521 tttcacgcca tagtgtttaa ggaagaagac ctacagcatg aacaatatcg ccagatatgt 
188581 ttcctgttca gcaccacatc ccaggaagtc gaaattattg aatgtggcgt ccagatcgtg 
188641 agggacagaa gtggcagaaa aagaagcgac gccgatgctg tgaatataaa tcacaacaac 
188701 aactgttgga agacgacgat gaaagtctct acggtagcct tgactatata tgatgcacca 
188761 agagtagcta ccattaagga ttttgctaag tctctttctt cactttttat tttcctattg 
188821 agtttgattc cgagtgttcc tttttatagc ttactaaagg aacataaaga cttgataagt 
188881 gattatttca catcattctt ttccttctga gttcgatttg aagtgttact tttgtgtaac 
188941 ttccttttaa gacttgggta tgtgatcttg tcgttgtatt tactcatcag acaatgttca 
189001 gtagttgtgt atagacttca gctacctact cgttttgcat tttgttaact tttcaaagaa 
189061 attctacgtc tgctgggaag atgttgacgt tgcccaaact gctcttgttt agctgttgag 
189121 attttcaagt gttcttttga atagctttta ttgttttaca ccttgactac agttattagg 
189181 aagtatacct gggaaagtgt cttcttcctc gtatatttag ggtcctaata tgttgtgggg 
189241 ctccatttag aacaaaacaa tagctgttct gttttaatgg tttatgaaaa tgcagaaacg 
189301 tctcaatgga atgatatgat catatgttgt ggacaggcac atacagagta catttatatg 
189361 aatcatgggc ttgttggaag aactagactt cattagaaag agatgaagtt tccttcaaaa 
189421 gaagtctcgc ctgaacaaga agaaagcaga acaagccaaa agcgacaaca acaaggatcc 
189481 ataaaaagtt ccagtttcag gtaataactt gcgtagaata agtgacaaaa tatttagttt 
189541 gaatatggca cacaagatta gacaagaaga cgttgacgaa aaaaaaaaaa aaaacaagaa 
189601 gaccataacc ggttgaatgc cggtttaaaa cttaacatct aaccggtaac cggactagcc 
189661 attctcattt gaaaacaact aataagcgaa gaaaattcag ctcaagtcaa cacactgagt 
189721 gaaaggactt taccatcttc acatctggct catcgggaga gactttcatc gatactctgc 
189781 tctcatggct tcttcttcct cttcttcctc gcctcgcaca tggagatacc gcgtcttcac 
189841 gagcttccac ggacctgacg tccgcaaaac cttcctcact cacctacgca agcagtttaa 
189901 ctgcaacggg atatcgatgt tcgacgatca agggatcgag agaggccaca ccatcgcccc 
189961 tgctctcaca caagcgatca gagaatcgag gatctctatc gtggtgttaa cgaagcacta 
190021 tgcttcttcc aggtggtgtt tggatgagct tttggggatt ctgaaatgca aggaagagat 
190081 cgggcagata gtgatgacca tcttctacgg agtagatcct tccgatgttc ggaaacaaac 
190141 cggagatttc gggaaagtct tcaaggacac atgccgtcgt aaaacagagg aagagaggcg 
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190201 aagatggagc caagctttga ccgatgtggg aaacatagct ggggaacact ttctcaactg 
190261 gtttgtttgc tttttcttca ctcttgtacc tttcttgatt ctcacatatg aacgtaatct 
190321 cttgttttag ggacaaggaa tcggagatga ttgaaaagat tgcgagagat gtctcaaaca 
190381 agcttaatgc tacaatctct agggattttg aagacatggt tggtattgaa gcacacttgg 
190441 ataagatgca gtctttgtta catttagatg atgaggatgg agctatgttt gctggaatct 
190501 gtggccctgc tggcattggt aagactacca ttgctagggc tctacatagt cgactctcta 
190561. gcagttttca tcttacttgt tttatggaga atcttcgagg aagctgtaat agtggtctcg 
190621 acgagtatgg attgaaactg cgtttacaag agctacttct ttcaaagatt tttaaccaga 
190681 atgatatgag gatataccat ttaggtgcga taccgcaaag aatgtgtgac caaaaagttc 
190741 ttatcattct tgatgatgtg gacgatctgc agcagcttga ggctctggct gatgaaacta 
190801 actggtttgg tgatggaagc aggattgtgg tgaccacgga agatcaagag cttttggagc 
190861 aacatggtat caacaataca tactatgtgg atcttccgac tgacgacgag gctcgtaaga 
190921 ttttttgtag atatgctttc agacggagct taacaccata tggttttgaa actcttgtcg 
190981 aaagaacaac agagctttgt ggcaaacttc cttttggtct ccgtgttcaa ttttacgcgg 
191041 aaagaaagaa gacgactggg aaagtatatt gcaaaggcta gaaaatagca atataccaaa 
191101 gatcgatgca gtacttagag ttggatacga cagtttacat gagaacgaac aaactctgtt 
191161 tctcctcatt gccatcttct tcaactacca agacgatggt cacgtgaaaa caatgctcgc 
191221 tgacactaac ttggatgtca gactcggctt gaaaactctc gcttataagt ctctcacaaa 
191281 aatatctagc caaggaaaaa tagtgatgca caagttacta caacaagtgg gcagacaagc 
191341 agttcaaaga caagagcctt ggaaacgtcg gatcttaatt gatcctcaag agatctgcga 
191401 tgttcttgag ccttggaaac gccaagtcct aactgatacc gatgagattc gcgatgtcct 
191461 tgaaaatgat tctgtacgtt cattttcttc tatctgttac accggttcat ttagcaaagg 
191521 cctatagatg ttaccacctt ttgttctaac ataatttcct ttattttgaa tttttttttt 
191581 agggtagtag aaatttgatg ggggtatctt ttgatatgtc tacaatctta cacgacatgg 
191641 atattagcgc aagagctttt acaagtatgc gtaatcttcg atttctcaag gtctacaaaa 
191701 caagatgtga tacaaatgtt agagtgcatt tacccgagga catggagttt ccacctcgtc 
191761 tgaggttatt acactgggag gtatacccga gaaagtttct tcctcgtaca ttttgtactg 
191821 aacatcttgt ggaactctat ttaagagata ccgagctcga gcaattatgg gagggaaccc 
191881 aggttggtta ttttatttta ttttctgttt gtgtgtactt tggatagtgt ttaatggttt 
191941 ttactggttt gtgaaatttt atataaatct aaatggaata tagaaaacaa tgaaccttaa 
192001 aaaccaaata tgtggaagtt acagtagcat cgtcaatttt gttaagagtt tcaaacatat 
192061 gtatgattcc catcatttca taacattaat atggtattgt gctgttttac attcagcccc 
192121 tcacaaatct caagaagatg tttttgggtt cgtgcctgta tcttaaggaa cttccggatc 
192181 ttgcaaaagc tacaaaccta gagaaattga ggttggatcg gtgcaggagt ttggtagaga 
192241 ttcattcctc tgttggaaac cttcataaac tagagagttt ggaagtggct ttctgttata 
192301 atctacaggt tgttccgaat cttttcaact tggcatctct tgaatcattc atgatggtgg 
192361 gatgctacca actgaggagt cttccagata tttctacgac catcacagaa ctctcaatcc 
192421 cagacacact gttagaagag tttactgaac caattaggct ctggtctcac cttcagagac 
192481 tcgatatata tggctgtggg gaaaatttgg agcaagtgcg aagtgacata gctgttgaga 
192541 gaattccaga ctgcatcaaa gatctccaac ggttagaaga actaactata ttttgctgtc 
192601 caaaacttgt atcactgcca gagctcccta ggtcgctcac attactaata gtatacgaat 
192661 gtgattcact ggagacacta gcacctttcc ctttaggttc tgagattgaa gctctctctt 
192721 ttcccgaatg cttcagattg gatcgagaag caaggagagt aattacccag ctgcaatcat 
192781 catgggtatg cctacctgga agaaatatac ctgcggagtt ccatcaccgg gttataggaa 
192841 atttcttggc catatgctca aatgcatacc gatttaagct ttgtgccgtg gtttccccta 
192901 aacaggtgat ggtggaagat gaagatatag aattactgtg tcacatactc ataaatggtt 
192961 gccccatgaa gagccccatt aagagcatat ataatcttag gattagaatc caatcagaac 
193021 atctgtttat atttccctcc acaatgctca aggaagaccg acagcttgga caatacagtg 
193081 agatattgtt caaatttagc accacatctc agaacactga aattattaaa tgtggtgtcc 
193141 aaatcttgag ggacagaaga agctgtgatt ctaagtcaga acaagacgac gatgaaagtc 
193201 tctacagcag ccttgactat gatgcaccaa gagtagatac cattaagtat tttgctaagc 
193261 ttctctcttc actttttatt ttccttttaa gtttgattcc gagtgttctt ttttacagct 
193321 tactaaagga acataaagac ttgataagtg attatttcac atcattcttt tccttgtgag 
193381 ttcaatttca ggtggttcct ttgtataact tccttttaag actggggtat gtgatcatgt. 
193441 cctagtcatc tttgtatttt acatttttac ccgtcaagac aatgttcatt agttgtgtat 
UllV: ?? acttca 9 c tatctagtcg ttttttaatt ttgttcacgt tgttgacaag atgttgaagt 
193561 tttcaagtgc tcttttgtat agcttttaca cattgactac ggttattact tattaggaag 
tataccca 99 aaagtgtctt cttcctcgta tatgttgtgg gactccattt agaacagaac 
illfE* a *tagctgtg tttttttttg ttctttgctt gacttgggaa tggtttttga aacagctaag 
193741 cgacgaacat cttaaaagtc aacccacgtg acgtcttctt cttcacatat ggctcatccg 
193801 tagagacttt tttttttctt tgacatcgta gagactttca tcgctaactc taattctcat 
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193861 ggcttctaaa tcttcttcct cttcctcgtc 
193921 cttccacgga cctgacgtcc gcaaaacctt 
193981 caacgggatt tcgatgttcg acgatcaagc 
194041 tctcgcacaa gcgatcagag aatcaaggat 
194101 tccaggtggt gtttggatga gcttttggga 
194161 atagtgatga ccatcttcta tggagtagat 
194221 ttcgggaaag tcttcaagga aacatgccgt 
194281 agccaagctt tgaccgatgt gggaaacatt 
194341 tgctttttct tcactctttg tatctttctt 
194401 ttagggacaa ggaatcggag atgattgaaa 
194461 atgctaccat ctctagggac tttgaagaca 
194521 tgaactcttt gttacattta gatgatgagg 
194581 ctgcaggcat tggcaagact accattgcta 
194641 tccagcatac ctgttttatg gagaacctta 
194701 atggattgaa gttgcgttta caagagctac 
194761 taaaactatt tcatttaggt gccataaagg 
194821 ttcttgatga tgtggacgat ctgcagcagc 
194881 ttggtgatgg aagcaggatt atcgtaacca 
194941 gtatcagcaa tacataccgt gtggatttcc 
195001 gtagatttgc ttttagacag ctctctgcac 
195061 taataaagct ttgcagcaac cttcctttgg 
195121 gaaagaaagt agacgactgg gaaggtatac 
195181 agattgatgc agtacttaga gtcggatata 
195241 ttctcctcat tgcatgcttc ttcaactaca 
195301 ttgatagtaa cttggatgtt agactcggct 
195361 agatatccgc agaaggaacc atagtaatgc 
195421 cggttcattt gcaagatcct aggaaacgcc 
195481 atgtcctcga aaatgattct gtaagttgtt 
195541 atgctaatat tttattttat ttggaaatat 
195601 gtatatcctt tgatacatcc acaatcccaa 
195661 gacgaatgcg tgatcttcgg tttctcagca 
195721 gagtgcattt acctgaggac atgagttttc 
195781 tatatccagg aaagtgtctt cctcatactc 
195841 ttgttaacag catgctcgag cagctttggc 
195901 acacactggt ttaaagttta gaaatctata 
195961 aaagtacttt tgatattttt ttaaaaacat 
196021 tgtttgaaaa aatggatagc aatatattaa 
196081 aaaattagaa gtctattata taatcataaa 
196141 aataaaaata acttaaaaat gatttatatc 
196201 aaaatttgat ttttactaaa atattttcca 
196261 aaaatgtaca atacaaagcc aaattttaaa 
196321 acattaagtt tgaaaatata attcacattt 
196381 ataaaattta tttatataca tatttcaaat 
196441 tcaaaaattg attcaaaaat atacatatgc 
196501 caataaccac tataaaaaaa aatattttct 
196561 aaactctaat ttcaaacacc aacttaaatt 
196621 tatatgtgat agtttatata ataatacgta 
196681 aatagtgaca tagaaaacaa atagtaattt 
196741 gtaaacaaag tctagttatc attatatatt 
196801 taagctttgg caaggagtgc aggtttgtta 
196861 ttagaaattt tatcattgta tattgattgt 
196921 agatggactt gtccgggtca ctgagtttga 
196981 gtctcaagag attaaatcta acaggttgct 
197041 gagaccttca taaactggag gagctggaga 
197101 cgactctact caacttggca tctcttgaat 
197161 gcaaaattcc tgatcttcca accaacatca 
197221 aagagtttcc tgaatcagtc aggctttggt 
197281 gtgtcctcac agtgccactt ttggaaacca 
197341 cgattgagag gattccagat tggatcaaag 
197401 ctggctgcac gaaacttgga tcactgccag 
197461 tagacaactg tgagtcacta gagacggtct 



tcgcacatgg agataccgcg tcttcgcaag 
cctcactcat ctacgcaagc agttcaccaa 
gatcgagaga ggccacacca ttgctccttc 
ctctatcgtt gtgtttagaa ctatgcttct 
attctgaaat gcaaggaaga gatggggcag 
ccttctgatg ttcggaaaca aaccggagat 
cgcaaaacag aggaagagag gcgaagatgg 
gccggggaac actttctcaa ctggtttgtt 
gattctcata tatgaactta atctcttgtt 
agattgctag agatgtctca aacaaactta 
tggttggtat tgaagcacac ttggatgaga 
atggagctat gtttgttgga atctgtggcc 
gggctttaca tagccgactc tccagcactt 
gaggaagctg taacagtggt actgacgagt 
ttctttccaa gatttttaac caaaatggtg 
aaaggttatg cgacctaaaa gttcttatcg 
ttgaggcttt ggctgacgat actaactggt 
cggaagatca agagattttg gagcaacatg 
caactcaagt agatgctcgt cagatctttt 
cccatggttt tgaaaaactt gttgacagag 
gtctccgtgt catgggctcg tctttacgca 
tgcagagact agagaatagc tttgatcaaa 
acagtttgca taaggatgac cagttcctgt 
aagacgatga tcacgtgaaa gcaatgctcg 
tgaaaaatct cgtgtataaa tctctcatac 
acaagttatt acaacaagtg ggtagagaag 
aaatcctaat agattctcac cagatttgtg 
ttttttctcc tttcttcttt ttatatgatt 
aatgatttta ggatggtaca agtgtgatgg 
acggagtgta tataagcgcg caagggttta 
tctacgagac aagacgtgat cctaatgtta 
cacctcttct aaggttatta cactgggagg 
ttaggcccga acatcttgtg gaactctgtt 
aaggagtcca ggtttgttac ttatatttga 
taactaaatg tgaagtaatt ttgaggaaaa 
aaacaataat ataaaaaatg agtatcatgt 
aaaaaaataa taatttactg ttaattaaga 
acaagattta aactttatat acatatttaa 
gaaaattgat tcaaaaattt acatatattt 
ctaaccttat aaaatatttg gaatgtatat 
taccaagtta aattatggtt tttataattc 
aaaaaatatt ttttctaaaa taatcatcaa 
aaaaataata atttataaaa ttgatttata 
tcaaaaattg atttttacta aaatattttt 
atatatatat atatatatat aaatacaata 
atggtatttc tatttctttt aaaaatataa 
taaaaactat tatacaatga cataatacaa 
ttctttgaaa aacaacatcc gcgcgggtgc 
tgataaaatt tacaagtttt atattttatc 
ttacgtttga acaccgtctg gtttttaaat 
tttacattca gcctcttaca aatctcaaga 
aggaagtccc tgatctttca aatgctacaa 
ggagtttggt agagattcct tcctctattg 
tgaatttatg tgtaagtgta caggtttttc 
cactcaggat ggtaggatgc tggcaactaa 
aatcacttgt agtcggggaa acaatgctac 
ctcaccttca tagtctcaac atatatggca 
cgtcgcaaga attctccctc gctgctgcta 
attttaatgg gctaaggttt ctttacatag 
agctccctcc ctcgcttaga aaactaatag 
gttttccttg cgacactcca acaactgatt 
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cccattagct 

caacagcttc 

tgattttgtt 

gggtttcatc 

ccacacagga 

gttaatacgg 

agaacttgta 

caatgagcga 

aagaaagttg 

gatcaaaagc 

atgatgcttg 

agtatggcat 

accaagtaat 

cccggctttt 

ttgtattcac 

ggcattattt 

tcgtttaact 

ctagccatat 

acaaaaatat 

aaaaaaacag 

tgtagaaaga 

ctttctaaca 

cataatctct 

attaagaaaa 

acgtggtcca 

tgccgagacc 

aaactaagtt 

agaataaatg 

gcacataccg 

agagatcagt 

tttctgatat 

tccggaaaga 

tacatacatt 

ttggtcactc 

acagaggaaa 

aatgttggtt 

tcaattacaa 

cagggtatca 

agaacctgaa 

cgactggcgt 



ccccaactgc 
agcttacttc 
aagttccttg 
tcctgcaccg 
ttgtcccagt 
ttttatattt 
ctcatcccac 
cagaagaagc 
ggatgatgat 
aatgtttctt 
tagattccta 
atctacccat 
atggattcat 
tgctcttgtt 
atctggaaaa 
ttgaaaaata 
ggtgcaaata 
taaaagaaaa 
acgttcagga 
tgataattcg 
tctcaagcag 
gacattatca 
attcatccag 
gtgtgtgtga 
gtagcgcttc 
aagaaaaaaa 
gagtcaaacc 
acctgatcca 
gtaaacctga 
cggaacgaat 
tgggtgatcc 
aaggaggaac 
attttaaaca 
cgccctgcca 
gtgcaagagc 
ccaatgcaag 
caacaaagca 
taataatgac 
tggggtggtc 
aacaacagtt 
gaacttccta 
agaagacata 
aactttctaa 
ccctccccag 
agataaaaga 
taagggaatg 
ctgaatgaac 
cagcatgagc 
aatgaacgaa 
attgaattca 
ctcaacacga 
atcagagaac 
aattgaaaac 
acaaaaggtc 
gaaggattgt 
gtgtaatgtc 
cttttgttgg 
agcaagccaa 
agtttggtgt 
tttcaaagag 
tccagcttga 



ttcatgttgt 
ccaggaaaag 
accatcatcc 
gatatggaag 
gatgatgaca 
cacattgagt 
gaagtcgacg 
aatgaatctt 
aagagcaatg 
tctttagttt 
aagaactaaa 
tcgatagcgg 
ggatacagtt 
gtctcgagtt 
ctatgttaga 
acacatactg 
gtaaaataag 
acatcatttc 
taagtttatt 
gagctacgag 
ccaagctctt 
tctccaactg 
acatcttctc 
gagaaaaaga 
cacctcgaaa 
aagaaacaag 
tgaggattga 
tcttttgcat 
atggtcgtcg 
aagagttttg 
aaaagcttct 
catttattaa 
tggaggcaga 
cttgttatag 
tgccaaagca 
ttccacgtat 
aacacacaaa 
tcgcaaatgc 
aaagacataa 
tgaaaagttt 
acttcagccg 
ctatctttag 
cggcatgaac 
gtaatgttat 
ttaagaagtc 
tatagtcgta 
tttgggttta 
ttccttgcca 
ctgcgcgtaa 
tctgaatcat 
actggggcac 
acaaactctt 
actggacgtg 
acgtaacgta 
gctgctgctt 
ttttccactt 
ctggcccgga 
tgtgtcaaca 
agaaggtcat 
aaggtacgag 
gagtacctaa 



gccaagaagc 

agatgcctgc 

gtccggctat 

aagcttattt 

tgctttcgtt 

ttgttgaaca 

ttattgaatg 

gttcagaaca 

agatttatga 

ttacgttcct 

acttgagtgt 

tcaccagtag 

ggaggctaga 

tcttattgat 

tcgtaagttg 

tgtctacttt 

taaataacta 

gttagagaaa 

attgtaaagt 

tagaaatgtc 

agaggcaatc 

tgcctgtcaa 

ttaggattat 

gagacttagt 

acgcagatac 

ttgtaaaggt 

ttttgtctaa 

ggaaattcac 

cattgtttca 

agaattcact 

tgacattctc 

caacctgaga 

gggaagtaac 

aaatcagcaa 

gcaatttggt 

gctggatcct 

gaagttagtg 

caacctagag 

gcctcagtgc 

cagtgaataa 

acactgactt 

aagctcagtt 

agcatcagga 

tgccttggct 

gtatcctatg 

ccttgtccaa 

gaaccaatgg 

gaacttgacc 

atgaaaaaag 

gggctaagag 

tgaatcctct 

cttctttctg 

gaatagagtg 

ttgctttaag 

gaaatggtct 

gagtccagaa 

gtcgggggta 

gcagattcaa 

ttggcaaaac 

cttcagcatt 

aatatcaata 



aaagagagta 

tgcagagttc 

ctgcaagttt 

caaattactg 

agatctcgct 

tcacgaggag 

cggtgtacag 

agtgtctgaa 

acccagagta 

tttgagtttg 

atgatattgt 

taatttcacc 

tagatggtga 

tgaaaatcta 

ataatgaaag 

gcatggactt 

tgggtgaatg 

aagtacataa 

gcatgcgttg 

tttaaaaagg 

tcaaatacat 

accaccacaa 

aaaaccaaca 

taccttggcc 

catacgagaa 

tgtttatttt 

ctggacaaca 

tggcgaaccg 

tcgttagcat 

tgccttgttt 

cacattcccg 

tcagaagcca 

gtgtgacatt 

gaacctcatc 

ctgtcaaatt 

caagcgatgc 

acttccgcac 

cagtgttctt 

tcctattgtt 

acagcgagtt 

gggtctcaaa 

ttaagttcag 

tccgccacgg 

ataaattcct 

ctacgcttcc 

gctcgagtca 

tttgttttgt 

tgcctcccac 

atctaaaatg 

gaagaacaga 

aaacagagac 

tacgtaaaca 

gagggcgatg 

aaaggaattc 

gaagtgtacc 

gaccaacaac 

tctcctgact 

taaaggttca 

agactagaaa 

gtaagaggat 

ttgtatatcc 



atcacacagc 
gatgatcatc 
aggatttgcc 
tttcgcatac 
aaaatccaag 
atggtgttca 
gtcttgacag 
gacggggatg 
aagatattta 
atttcaagtc 
ttgtgttgga 
cgtttaaaat 
ctttaccatt 
attccccttg 
gatccatcaa 
cctcattttg 
ttcattcttt 
cataaaaaga 
aatgtaaaaa 
atttgattct 
tttcagataa 
aacacaaaca 
aaaatgctaa 
agagcttgtc 
gcaacctaga 
tgtcaagaaa 
atgtcaccca 
cagaaccctc 
acatctgaga 
ggattgcgca 
ggaatgtcgg 
gataaagctc 
gaagatctat 
acgggtttga 
ggtggccgac 
aagataagct 
aaagagagac 
caaagcacgc 
ctcaacctgc 
caaacgaaat 
agatttacta 
atgcaagctg 
ccatcatgtc 
acaccaatag 
ttgtttgact 
gaaagcacgc 
tggaaatcag 
ctaaaaatac 
ttgtggaaag 
tcatcatcag 
ggcacaggtt 
aaacataagg 
taacgtagac 
tgaactaacc 
attatggtaa 
cactggaata 
ggaaaagtta 
attgtaacgg 
tgttaagaac 
gccactttga 
ttattattag 
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201181 gatgaaaaca ataatcttca caagacattc acggttgaaa tatgacttac cattgcaaga 
201241 aattagcaaa atctgcattg tttgcatcac gcatagcagc aaagaagtct tcacaggtca 
201301 cagcttgctc atcatgtctt tgaaaataaa gatcaatacc ctgtaacata tgcacaaaca 
201361 gaggtgagcg taaggaaaac gaaataagac ctagatactg gatacacaag aaaaaagagt 
201421 attggaaagc aacctttcgg aaaccctcac ttcctagtag agttttgtac atcctcacaa 
201481 cctcagctcc ctaagaaaac attatatagt cagctaaggt tttttgctta ctgcatataa 
201541 cagcaacatt atcatcgaga caaacctttt cataaacctg aaacgaaaac aaaaaccaaa 
201601 tcattctgtt agaatcaata cataagggac tgataggaaa tgaaactaaa tagagaaaaa 
201661 taatcttaga ttccctaccg tcactgatag agcagggtga tagccaggca cgtaatccaa 
201721 gattggaaca gttgcaaaaa acacaagcac gttagattgg aacccaatac aaatagtaac 
201781 cagatgaaag cctataacag aatcccagac ttcttagcaa acagaaactt gcctgtgtag 
201841 aagttgtcca tctgcacaga aagttaaaag agcaatgttt atcaacacga acgtaacatg 
201901 cacaacatag tttcaggaaa agatatgtaa caagactctg accttgatgt atgaatgtgg 
201961 gcgaacagga tgtgccatag gaccagcatc ctggataatt aaacaacaga ggattggctt 
202021 agaaaaaacg agataaaccc atgtagtgac ctaaaggagc acaagactag gtacctgcgg 
202081 gaattgatag atcctgagct ttgaaacatc agcaatacgc tttacagtac ggcttcccat 
202141 atcagatgaa aactcctgta aggaattgga gaaaaataat cattagctag agaaaaattt 
202201 acaacacgaa tcttcatatt atgaaggata atttatgtac ctgatcacgg aagacagtta 
202261 gaccttcctt tagactgagt tggaaccagt cacggcatgt caccctatca tcacaaatag 
202321 agtttcagtt ataaagacaa gcatataagc aaaactggaa gatccaacca acaagaatca 
202381 aatcatttgt aaattttaat aactaataag gtaagataaa tgaggctgac ctgttgcctg 
202441 tccaattgtg gaagtactgc acagcaaaaa tgaaagacac ggcataagat ttcaatagaa 
202501 aaaaacatgc acaacataaa atttcaataa acagcataat gtacgattac atacttcatg 
202561 accaataact cccaaaattg cagcataatc tgcatctgtt gcagtttctg gagatgccag 
202621 gacaagcttg gaattaaaaa tctgcaaggt agagaagctc atatatttcc atcaagacaa 
202681 aatatcagca caagtacgtt attgttataa agtaaagtag aattacgttc aaactcttgt 
202741 tttccatggc tcccctgaaa atggaaaata aatcaattag tttgattatt ctcgacagta 
202801 agaaaagtaa aaagacaggg aagcatgagc tctttatttt ggatagaatt cactatttgt 
202861 tcatacatgt taaaatctgg aacggcgaca atgttgaaga gatccaggtc atactcaagg 
202921 ccaaacacct agaggaaccg gttaagacac aaaaaaacat cagattcgat acctaacaaa 
202981 tacatgcaat actaaaaaaa cttttcaaag tacttacatc ttcatcccac ttcatggccg 
203041 ccttcagaga atacatggca tgagcagtct ttggtagatc ttctgcagga gtccagattt 
203101 tcagagatac ctccctacca gagcgtgtag taaatgtatc atctctgctc gctagctgtc 
203161 cagccaccag agcaaataga tagcatggct tcttgaaagg atcctcccat aaggcaaagt 
203221 gccgacctcc ctacaaatta acgaagagga tatgcgttca gtaatgacaa agtgagagga 
203281 catttattcc agaaaagggt cactattttg ctaacctcta catctccttg agaaatgagg 
203341 tttccgttgg acaacagtac aggatagagt gacttgtcag cttcaacacg gcatgtgtac 
203401 ttcgccataa tatcaggacg gtcctaatca ttgcatatta aaaacactat gtaagaacat 
203461 caattgtact ttctcaacaa gagtcatctt atatagaagt ggcaaccaac ctggtaaaat 
203521 gtgattttcc ggaaaccctc tgcttcacat tgtgtgcaaa aattcccaga tgacttgtag 
203581 agcccctaac agaataaact taaaaaggtt aaaactgttc tggtagatca ttaaagatag 
203641 caggaaaaga aaaaagttga aggaacagat gcagaaaagt ttacatcaag tgaagtattc 
203701 ttgtgggggt atatctcagt atcaatttcc agaacaaagg actcctcggc cggcagtgaa 
203761 ggcagagtga gatgacgaga atccaactgg taatcccctt cctaatcaaa gtgaaagaga 
203821 gaccaagtaa ctaaaccaaa ctgatttaca tatattctac cctacggagt tgattttact 
203881 aatataaaaa tgaaaattag tactacatag aataaaagag tatgtatgct taaattgata 
203941 cctttagaag cttcccctca accttgacag agagtagctt caagtcatgc ccattcaaga 
204001 ccaagggagc agaagatcct gggcaacaga atacttgttc agaatgtgat gtgaaaaaaa 
204061 atggaagaaa caaagatgaa tgttctgctt gagtaaagtg ttaatgtcaa aaactagtaa 
204121 tgtactagac tgttgtacct ttaactcgag gggaaacttt gatcttggag ctaacaattg 
204181 tcttctcttc acctagagag aagcttagat ccacctaaac aaacaaacaa acaaaaagag 
204241 gttcagatta aaacaatggt gattctacac ataggctcaa actgaatttt aaagaaggga 
204301 agcatacagt ttcaaagtag taatcaggct tggtgtagtc cttgagaaag atttccttag 
204361 gtgcatccat tttggattct tcaactgatt cggtggcaac agaacaaatc agcctcctgc 
204421 tgctgtctcg cttatatctc tgacatatac aacaaaaaag atggatttaa ttttgcttca 
204481 taaaaatcca ^cattggggt tatatttatc atagaggaag gaaagggaga gttacatcga 
204541 cagaatatga tagaaactga ttcttgctca aacaagtaac ctgagacaga gaaacaaaac 
204601 tgaagaccaa tctcaacttt aagtaacagt tcagttggtt gagaaaacga aagaaacaca 
204661 aaacctggga agtaagaaag ggtctgtatt gagtaagtct ctttgctgaa ttacgtaacg 
204721 gaagtgaacc gcttcttaag ggaccctaca aacaagagaa aactgaaaga taaattaaac 
204781 tacaaaagcg aaattaaggg aaatgaatac aaaagcaaag ctaaccctaa acaacatatg 
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204841 catattatac gagtgatcaa aaaagtcaaa gctagtggcc atatacaaac tcaatcattt 
204901 cgatttagaa tctttaacca aaggaaccaa aaaacattaa tccaaaacgc aattgtaaaa 
204961 ctttaacaaa cattaatcga tttggacctt taaccgaagg aacaacaaaa gaaatcaaaa 
205021 caagagagac agagagagag tttagagagt aatcacagga ggagcacgag agatcaaacc 
205081 gagaaggttg aacctagtgg ccaaagacga cgagcttcga caaggaagta tcaatcgagc 
205141 cataacaccc aaaaatcaag ctattaaaac cctagaaaac actctccact acagagaatc 
205201 aatcaaaaca cacagtaaga gaaaaccaga gcgacaaggc tagggctgcg gactttataa 
205261 gggatgtgtt aatcggatag catttaaatg aaaaagaaaa ttatggccac gaaaattcaa 
2 05321 tattaatatt aacaaacaaa caaacaaaaa attgatcttg cgattcgagt gatgatcaaa 
205381 atcgatagga ggaagaagac gagattaata cccgagatct gcgattatca gatgaaggag 
205441 gcacaactca tataatcaca agaagaaaaa tgctttgctt tgcgtgctct gctgatatat 
205501 ttatataacc aaaaaaaaaa gatatttttc gctaacaaaa gaaaaaggat aaaaaggaat 
205561 aacagaaatt atttgtattt tattattgtg tcttatccaa tttcctatcg ctgcacttta 
205621 gcacctcgag gttactctat tccaaaaaaa atcagtgttt ctgaaaaaaa aaaatatttc 
205681 caaaacatat aactgtgact aataaaattt aattggttaa caaatataga aaataactaa 
205741 tgacaaaaaa ataatgtata agactattag taatcaaaat tttatatttt ttttggatat 
205801 agtaagaact cgaaaacata cattttttga aatagagagt aatttatttt tggtcaacaa 
205861 ctcaactgaa gaagagaggt gtattttact atttcaaccg gataaagaag aggttacaac 
205921 aaatcgatgc taattctagt taaccaataa ccaaaatcgc aattttctct aacatatcat 
205981 ggtaaacaga gtttgacaaa gacttgttga agtatagttg gcttcataag cggtgcattt 
206041 aagaacaact tatttgaaaa attaaaattt catcaataac attattgatc tgtttcagca 
206101 tattatatcg gcaacaatat gcaattgcct caaccacata gaaatggcct agaaccatag 
206161 aaaaaaaaat ctcaacaaat agcttttttc ttgtataaaa acaacccctt atagcctaat 
206221 tagctaatgc cttcacagct ttctctgctg cgtcatccaa atcatcagct gttatgagtt 
206281 tcattccact ttccttgagg attctctttc cttgttcaac atttgttcct tccagacgaa 
206341 ccaccactgg tactttcagt gacacctatc atgtggggtt tggtttcgca caaaactcta 
206401 tcaattaact atttatgttt atatatgatg aaatatgtat gcttcgtttt gctcacctct 
206461 ttagcagcat tcacaattcc actagcaatc acatcacatt tcattatccc accgaatatg 
206521 ttcaccaata ttgctttcac tttatcgtcc gatgtcagta tcttaaacgc ctccaccacc 
206581 tattattatc acagcctttt ggttaaactg gtttaaacct aacaagcttg atagccagag 
206641 atactacgaa gaacatgttt tccttacctg gtgttcagaa gcgtttccac caacgtcaag 
206701 gaaatttgcg ggagtcccac cgtgcagttt aatgatgtcc atggttgcat ggccaatcca 
206761 gcaccattca ccatgcagcc aatctctcca tctaaaccga tatagttcag gtccactttt 
206821 gcagcagcca cctacgagat gaaacgacac actttccttc atcaagcttg ataataaaaa 
206881 ctaaaagtgt aacataattt tttattgtaa cctttctgat ttatttaacc ttaactggta 
206941 tatacacatt atcttgtgta caggaagagt attactaact ggcaaactaa tcaacaagct 
207001 gactaacaac gagaaaacca ataacaacaa aaactattta ccctatccag attcatgata 
207061 ccattttcta gagttgaaca tccataagct ttgatataac ataggtttct tggtgaagaa 
207121 catacgatac gatgcaatgg accatacctc tcgtggatcc tcctgcgttg gatcacgaag 
207181 ggcaaaaatc tctttctgac ggaaagcagc attatcatca aagttcaact tagcatcagc 
207241 agctaccaat tggttcgtgg atgtctcagc gagagggttg atctaacaga gaaaaagtaa 
207301 gggttcaaca aaagatattg aacttgtgta taataaacaa gacacagaga gaatgctcac 
207361 ttccaacata gtgcagtcac tcttgcggaa gagttcgtaa agcttcttca cttgttcaat 
207421 cgaatctttt ctgtcagcag cttttggagc cagaccatcc acaaccttgg cagcatcctc 
207481 atctgtaata cctgcaaata catcgatcgg tacctacaga aaaccatgga atgagggtca 
207541 aaattggctc aaccagagta ggaatacttg tggtgatgaa acacaaatca aaccttaata 
207601 atcatgtcag ggaacttctc agcaagatct tcaatgctgg taccaccctt tttacaggca 
207661 attataagct gaaagaaggt attaaagaga tgaacgacaa gatcattcct gagcagtaag 
207721 agaaatggat tttgcatatt tattattttc aggcagcagc ataccggtcc agcagattta 
207781 cggtcgagaa taatggaaaa gtacatctca ttgacgagtg acaacttctc acacaagtag 
207841 acctgctcaa aaaaaaataa aaaaaaaata taggaaacaa gaccgctgag caaaacttac 
207901 aaaacgctaa aaatcagtat gatcttcatc aaacacacaa accttgctga ctactttgcc 
207961 ttgaggacca gtttgtttgg tgacgagaac ttgcccaagc atcttgccta caaaaagata 
208021 ataataataa caataacaac aacctgaatc ggatcaagaa actctgatga gagaaaaaag 
208081 aaagtgcgtg tgcaaaacaa gtaccagcaa tatcttgaac ctgatcacgt ttgacaatgt 
208141 gaacaccacc ttgaagacca ctcttgaaag tcccgagacc tcttccacca gccaagatct 
208201 ggctcttaac gaccaactgt atattaacaa cacgaccctc ccatgagaca aaacaaacta 
208261 tttgttggca ataatgaaag ctcttataca gtacctcggt ttcattaggg aaaacttgtt 
208321 ggatagcgtt tttgacttca tcgagggaag agacagcgac tccgttgggc acgtttactc 
208381 cgtatttacc catcagctct gctccctgtc agagagataa ttgatattca aaaataaaag 
208441 aattacagat tcgatgaaga cagccttgtc gttcgtcaga tcgtgtccta atttcgcatt 
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208501 cggaaacagt tgggaaagcg aatctaggta gagagaaaaa aaggaaggag aagaacctga 
208561 tactcgtgga tgttgagacg gcgaagctgt tggtgctgcc atttgccgga gacggagaga 
208621 gatctggaca cgagcttctt caccaatccc ctcatccttt tgatttttgt cttacggtaa 
208681 aaaaaaatga aaatcgatca gagaagtata gagagagaga gaggaagaga atagtcgcag 
208741 cgagtctagg gtttttaatt ttctcaaatg atcgatcttt ctcccttaag aaatgagagt 
208801 atttcgtttg tttttctaag aaaaaaaaaa cattccataa aaatacaata caaatataac 
208861 ttctgtcttt tcttttccac aaaaatcaca aataaataag tactattggc caattggccc 
208921 aatcccggac ttttctactt atttttgact aattttattc acaaaaggta cataaaatta 
208981 actgaagaaa ctagggatag atgttcggtt tcggactttc aaatattttg atatatgaat 
209041 atagagcttc tttgagtatc tttaaacttt ggatcggata ctgatattac agaccggatt 
209101 tgattattta aagttattta gaaaataaga gcaagataaa catagttctg ttatgtacca 
209161 gattgtgatc gacataaccg gaccgaccag gactgtaccc accataggag ataatgctta 
209221 tcgactgttg tacttatcca cttaggataa acacgacctg atgtatctat atatgtaaga 
209281 ctcttggttg agattaataa caagaaacac gtttccccac ttagttttat aacacgttat 
209341 cagcacgaga ctctgaaatc ccgagctacc tcaaaaaccc taattgacgg caccacttca 
209401 aacagttctt tctctccaac cgtaaggatc caaacgatga tccacatatc agattgaagc 
209461 tcttgacgag acgaatccaa cgccgtcggc cacgcctcta tctgacttcg gacgcgccct 
209521 cacgctctat tataaaacgc gacgtcaagt gatctaaaac cctgaaattc taaaactcat 
209581 ccgacgactt caaacagttt gttctctcaa accgtaagga tccagacgac gcgtaatata 
209641 tcaaatcgca gctctcgacg aaacgaatcc aacgccgtaa acctcgtgtc aatccgatct 
209701 cggacgctcc ctcacactct gtttcaatac gcgccgccag taaccctaaa aactttaatt 
209761 ttggcgcaac ttcagaaaac gagtaatata tcaaattgaa gctcttgacg agacgaacag 
209821 atcaccataa accttgtgtc gatccgatct tagacgcgct ttcacgctct atttcaataa 
209881 gcgccgccag aaaccctaaa aaccctaaaa ctctaatctc ctcttttgtt ttacgatttg 
209941 tgtttgatca ttattttaac ttgtttttga ctcttatgtt taaggtctaa ccaagatcaa 
210001 acccttggtt catagattcc attcaaccca attggaggtt aagaagaggc gaccaatttt 
210061 ttatccgttc cggtcatgca gttcgcgatc cggctcgttc ctgctctcag tggtccggtt 
210121 ctacaaacat caaagtttag gtaattaaaa ttctgaaatc taaaagagaa cccatactga 
210181 atttggttgt attgataaag gtttaagact aataaaattt gcaaaacccc aaaccctaag 
210241 ataatagatc ccaaaatcct ttgagtaagt tggagctttc aagtccaaca gagattgttt 
210301 ctaacaatta aactcaaaac cctaatggtt ttgaatctca cagcttgatc tattgatcta 
210361 taaagtaact aaaaccctaa atatgaccta ttatgtatta tggtctaaaa attaattaaa 
210421 attcatgata atctcctaaa aatccccaaa gttagttttc catcatcaga atccgaccca 
210481 ttggtctgga actatgtgaa aaacgatttt tcggtttctg gtcagaaatt gaccccttca 
210541 aataaattcc gaaaaattaa ttaaaattca ccaaaaacca tgataaatcc taactaagaa 
210601 accatataag tagtttcttc agattcgttt ttaatcacat agtcatagtg taaatttttc 
210661 ataccgaaaa agtgcataaa aagtgtgtga ttgtttgaat tatttgaata acctagaaac 
210721 gattgttagg atctcattct gattttggtt gaatcattgg agattgcttg aaaaatttcg 
210781 gatttttcca gattgcaaag ttgcaaattt tacttttttc aaaactttcc ggttttgtga 
210841 aaattaactt tttatggttt ctgaaaactt ccatattagt tttagaataa tggaagattg 
210901 gtaaaactag tttaaaacca taatagaacg ttttccagat tgatagaatg caaaattaac 
210961 ttttctaaaa acttctattt ttgagaaatt gtcttttata agttgctgga aacttttcat 
211021 ttttcttttg gaagattaga aaaatgctat aattgttaag attgatctga ttttttttta 
211081 agtcatataa tggtttcgaa aataattatt aaaaaatgaa accatatatt ttcgaaaccc 
211141 taaacccttt tttctctctc tttagaatcc gataatattt taaatcatct agaatcaatt 
211201 atttgatctg atttatattt ttttccatct tgatcatgta gacaatgatt gctaaggttg 
211261 catcataaaa catttttttt aattgattga tcatatgaga aaatcagatt gctagattga 
211321 aagaaaaatt ggattgcatt gcgtaaattc agaggttgaa agaaactaaa atctgcattg 
211381 ctatattgac taaaaaaaaa ttggattgaa ttataaataa atggttgaaa gaaaccaaat 
211441 cacattgcat aaataaaagg ttgaaagaaa ccaaaatgca ttgtttgaat ccgattataa 
211501 gtcaaataat aagactctaa aatctatatt ttcagatgtc gaatttggat tatcaagccc 
211561 ttaatctctc tggagataat tatttaaaat gggctatgaa cactgcaatt gtcctgaaga 
211621 taagaggact tgacagatgt atcatcaaag gcgagtatgc agctgaaaat gaaaaatatg 
211681 gggcaataac aattattcgc caacatctca ctgaggatct cagagatcag tatctaaata 
211741 ttgcgaaccc tctagacctt cggacagagt taaaatccag atacacaata gtgtcattac 
211801 caaaatttat aactcaaatg ggtcttttat ttgagtggat aaatctcaga tttcgggact 
211861 ttaggtccgt agatgaatat aactcagctc taatcaaaat cgtttctaaa ttgaaactat 
211921 gtggtgaaga ggtaacagag gaagatttac tggaaaagtc attcctcgca gctgatccaa 
211981 gggatctatt gttacaatat acctacagaa aaaatgtttc accacttata cgaatttgat 
212041 ctcgtatcta ttacaagctg agaagaataa tgagatacta aagaaaacca gtgagatgag 
212101 actttctgaa gccaataagg ctggagagaa taagggtgaa tccaaagaag ccacgtccag 
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aataataaaa 
ttttatgttt 
tttatattat 
tttttttaag 
ggggcatgaa 
atggtttcca 
agtaaaaatc 
tgctaaaaga 
aattttatac 
agattgatat 
tgtacacaag 
gtacatgagg 
aataggtcat 
catatttaca 
acagattata 
aaggtctgga 
atgataagag 
ctattcatgg 
tcagtccata 
caacctaaag 
tggacagaat 
acaaggtcta 
aaagaagtca 
aaatgaaagt 
ggacctgcaa 
atccgagtac 
cagcatcaaa 
tacaaagtat 
tgatgcattc 
ttttgccaca 
gcaacgaacc 
gacataaccg 
tacttatcca 
agattaataa 
caaaaaaaaa 
aaataaacaa 
aaaaagcaat 
gtatgattag 
agaattagtt 
ccttgatcat 
atcaagggaa 
aatgctgaag 
agagtaaaaa 
atctgtagtg 
ccagtatatt 
tacagcccaa 
gaacgaaact 
acattatgaa 
actcaaaaga 
actgtattgg 
aacaagtcat 
gtcccatatc 
gaggtcacga 
tttctaagga 
gattaacggt 
gtcttgatct 
agagtatggt 
cctcacctac 
ggttctcgct 
atttctggca 
ctctctttac 



ggagtggtcg 
tgatttgttt 
catgtttgac 
aaaaaagagt 
ttattcaccc 
cattgaacca 
gcccaaggca 
ttagtatgat 
actttatggc 
cgaaaaaagg 
ggaaactcaa 
gggagaaatg 
gtgtgggaaa 
gcattcaaag 
aggagatgta 
tataagaaaa 
tatcagaaag 
attgaaacaa 
gatccttata 
agaggttcaa 
atgatgggag 
tgacttaaca 
tgagacatca 
ttccacggta 
gaaagattta 
tgattgagca 
ggagcacgtc 
tacaagaagc 
atcaggggga 
ttggttttag 
agtactggat 
gaccgaccag 
cttaggataa 
caagaaacac 
agtgaatcat 
aaacaaaata 
agttcactga 
atttaatgtg 
taatttatca 
tatcaagacc 
aacacaacgg 
agaaagatat 
gatcaaagtt 
acctcaatta 
cttggcttga 
gagctttctt 
atgaacttct 
gtgccgtcac 
tacacagaga 
ttgtattggt 
tgtggggaga 
gttaatagac 
ccttacttga 
gacaggccct 
tgtgatactc 
ctccagagtg 
tggcattgat 
ttgagttata 
gtaagaaaaa 
tttttgccat 
ctcaggggca 



gcttatacat 
gtcatttata 
ttgcttgata 
atagtaatta 
aaataaaaga 
atgggcaaaa 
taaaaatggt 
aaaggcaaaa 
atgactggac 
cacaaaagag 
taggctataa 
acactcccgt 
gaaaaagaaa 
aatggcttga 
ctcctatgtg 
gagaaatgta 
atgagaaaag 
ggcagttgca 
gaggatatta 
tggttcaatg 
tagaaacctt 
tgtatgtttt 
tccagagaaa 
tggcaaagcc 
atcgcaggat 
tcatccatcc 
ttgaccatgt 
tcatcgacca 
gttcatgtgt 
ggttttccag 
gtgtcgatca 
gaccgtaccg 
acacgacctg 
gtttccccac 
ataataattt 
aatagttaat 
agtaatctta 
aactattgaa 
atatttataa 
gtctaccaga 
atcgcaagtg 
gttatctcat 
gacaagtatt 
acaacaaagg 
caagaaccaa 
catcaaacgt 
tgggtaagag 
aaacaaatct 
agaaaccgag 
attttttttt 
aaaacgatgg 
agggaagtca 
acaggatctg 
caaacctggt 
ctcggaggct 
gtcgcacggt 
gaggatgtta 
ttgcttttgt 
tgattaaagc 
tgtgatagca 
gcctttgaaa 



tgcctaagaa 
ttttttttat 
atttcttgaa 
aaattatccg 
cccatttgag 
gaaatataaa 
cgagactgta 
ggcctagata 
tagccatcca 
ttatcccata 
tgttccaagc 
ggtccatgaa 
tcgtgagaca 
ttatacaagg 
gtggatgcta 
gtaagcagca 
aaattctcat 
aatgatagac 
taattccttg 
atttcaatga 
agccatatat 
cgaaaacata 
gtgaccagaa 
atgagactag 
agaggtacat 
gaccagaaca 
ccaaatgaag 
gataggaatg 
tgtacttttt 
gagaggtttt 
aagaggagtg 
accgtaggag 
atgtatatat 
ttagttttat 
gtaagttgta 
caaagagcag 
gaaaaagaat 
aagtgtgatt 
attcaaacct 
cattatatgg 
tttgggtgca 
tgttgagtct 
cttagcttgg 
gttcattcac 
agtattggtc 
acgcagaagt 
cttcggcact 
accagaaact 
aaacctctgt 
tttgagggga 
cgtattaagt 
agctatttat 
ttctataggc 
tgatgaatca 
ccgcactgta 
tgtctgacag 
taactttcaa 
tagcagagtt 
tttcgttgat 
gacttttgaa 
taggcatatg 



tcgagatttc 
ataataagaa 
tgataaatga 
accaaaggca 
ttataaattt 
ttccttaaga 
ctcccaactg 
accagatagg 
agtatttaaa 
gaatctcacg 
atctaaaata 
caacactaaa 
tggactaagg 
atactcacaa 
ctacaaattc 
tgattgatca 
agaatagttt 
taagaaaata 
tgtttatgtt 
tacaagttat 
gagtatattg 
gcattgtcta 
gaatgtctga 
aggaaccgtg 
ccaagtactg 
tgaagacatt 
ttccagaaag 
catcgtccaa 
ttcctgtcca 
aatgaggcaa 
ttatgtacca 
ataatgctta 
atatgtaaga 
aacaagttca 
agatgaaacg 
aacatcaagt 
atccctccgt 
agatttaatg 
ctaattcata 
taagactgtt 
aagtacttga 
atcatgtcag 
caatgacgat 
aagaacggtt 
tgcatcttga 
ggaatcactc 
gaggagacag 
tgtaatgaaa 
ttcagtctct 
gaaaacgaca 
gtgagatgtt 
aaagtgcaga 
tcgtacctct 
aggccgtgcg 
gaggatcgct 
gctttctttt 
ggcataaacc 
tcactctgtc 
tctacatttc 
ctttcttgct 
atcatgtggg 



gacattctga 
ttgttttagt 
caaatgattt 
ttgcctaaag 
ttgaaaatga 
ttataaagaa 
atctaaacta 
ttgaccacga 
attgatgcaa 
ttgtgtaaca 
tttatagcat 
accagtgaaa 
tgttcatatt 
agaccaagga 
gaaaatgata 
ctggataaag 
tgttcctaag 
gttagtacaa 
tataccaaac 
cgattgattt 
gcgtgtgatg 
cgacaaaatg 
gtcaataata 
ggacatgaga 
gtcgagtact 
gtcgagtggt 
ccggcgaaat 
agate tacag 
tggttttcca 
cattaagcat 
gat tgt gate 
tcgactgttg 
ctcttggtcg 
taaatcaaaa 
atatagtatc 
aatcaaagat 
aaattgaaaa 
tgaactataa 
tatacatgat 
tggtactgat 
tcttggcgtg 
taaaacaaac 
acaaaaacaa 
tctagatggt 
gaaagtatcc 
gattgagagc 
agcgtgagaa 
cgaaattacg 
tcaccagttt 
gcatattgga 
gtttatttac 
gtcgaagcta 
gtatccttga 
atctgagtgt 
ateeggttag 
ttcgtttatg 
tcaacagtcg 
atattctttg 
tcatattgtt 
ggtccgggta 
aagaaactca 
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215821 gactttaggg aacaaatatt ggaccgggaa gaaaaaaggg atagaagatc cataaagttg 
215881 aaaaatatgg gcaagtggac atgtactgat tagcaatatt attaccaaac cttgaagcat 
215941 aaaaatttgc tgatctctga gttaatttgt gaagttgaaa catattataa tgtaacatgc 
216001 tcaaaactat cccacaattg cttctttatc atgagtcata tctgtatttc tgcctgcagc 
216061 catgtttatt agaatgatgc agatataagt catttgggct ctgtgtcttc taaagctcag 
216121 actcattcag acaagagtga tctagattgt tactttctgc ctgccgtttt gattctcaaa 
216181 cttataattt tattttctta gtagaatgat gagcagaaga acttaggtga actctgggat 
216241 gttatgatac aaaacacaag ttgggcagac ccttttctga gatattgaga cacataactg 
216301 aaaaaataat taaaattata caaatagact cagttttcta ccttcttaga ttgtaatgaa 
216361 ctgtttggat ggaaccttga aaaaggccta tcaaaactca caagtcacaa gcagccttga 
216421 atggtggtca caggatcatg tggaaaaaga ctggaaggat gggattggta ctgattagtt 
216481 actaaccaaa ccttaaacca taaaacaaca tattgatctt ttttttttaa gtteatttga 
216541 aacataatct ctaatgtaca aagttcaaaa tgattccaaa atgctgttgt agttatgaaa 
216601 tcagtatatc ttcccctgag ccaccaggtt tactcaaatg gccggacccc gcagctcaag 
216661 cccatcagtg attcacgcgc ggtttccact tcccatagtt gagtgtgtta tgttagcaaa 
216721 acaaacagag ttaaaagatc aacattcacg tcttatttta gtctcttcat ctgctttact 
216781 gtatttgcca tttggtattg gtattttttc ttttgagggg agaaaagggc tgcgtattga 
216841 aataggtcat ttggggggac caaacgatgg cgtattaagt gtgagatgtt gtttattacg 
216901 tcccatatcg ttaatagaca gagaagtcaa gctatctata aagtgcagag tcgaaggtgg 
216961 aggtcacgac cttacttgaa caggatctgt tctataggct cgtacctctg tatccttgat 
217021 ttctaaggag acaggccctc taacctggtt gatgaatcat ggccgtgcga tttgagtgtg 
217081 attaacggtt gcgatactcc tcggaggctc cgcactgtag aggatcgcta tccggttagg 
217141 tcttgatctc tccgtagtgg tcgcacggtt gtctgacagg ccttttttta gctttgattt 
217201 ctttttagtg cactgaggtt agaaaccgca atgtataaag cagtgttgtt tttttctccc 
217261 cggtaatgga cagagcagtc gtgttgttta tagaacgaag ccaagaaggt cgaggtcacg 
217321 acattacttg aaccggatct gttctataga ctcgttattt ttaagctttg gttgatgctt 
217381 actacactct tttaagtgtt tgtacatata aggcacttga gctctgtgtc gtgtcttctg 
217441 aagctgaaac tcattcagac aagagtgatc tacagagata tttctgcctg tcgtttgact 
217501 ctcgacttat aaatttgttt cataatagaa ttatgagaag aacttatgtg aactctggga 
217561 tgttatgatt caaaacacaa gttgggcaga cccttttctg agatatttta agaaacacat 
217621 atcactgaaa actagttgag attataaaaa cagtctcagt ttgctaccgt agattgtaaa 
217681 tgaactgttt ggatagaagc ttgagaaagt cctatcaaaa actcactagc agctgtgaaa 
217741 tggacacatg atcatgttga aaaagactgg aggtattacc gtattggatt tgtactgatt 
217801 tagttactac caaaccataa cgattcaata taccctaaga gaattgccac atttactttt 
217861 aaactcatca ctttttttta ttcatttgaa cgttgtatat tgtattatcc tttttagtta 
217921 gcacgtcggt gctaattcta atcatccaag aatcaaaatc gcaatttttc tcaaaacaac 
217981 gaaattaaag gtgaaacaaa aactttgtaa agtgtttgtt tagaagccgt gcatgtgagt 
218041 atcatcaatc atcattattg gccgataaag cggcgttttt tagcagagtt catgtagggt 
218101 ctaagaaccc attccatctc tgcttcctct tgttccattg ttcctaattt cacctgcatc 
218161 accaagaaac atcatttgat atttaaggaa cctgattgat atatattcat tatacaattt 
218221 aacaaaccaa atcgaatcaa gacagaataa accgaaacaa tgcactttta cctggtagag 
218281 ccgcaactca aaccgaggac cgacttcttt taactctatc gattttgggc ctccttctcc 
218341 tttatcatac acatgattcc tatttttata tatatatata tgcccagaat cattttctat 
218401 caagaatcca cacaaacaaa caaacgaaaa catgcaatat cttaggacaa aaaaacttgt 
218461 acctgaaaga aatataatca gatttattac ggaaagtaac tatacgtctt gcatccagtt 
218521 ttggaactgg gaacatgtgt tttaacatgc tcccaactct tttacccatc tgccaaagaa 
218581 aaaatacagt aaacaaactc tacaaaactg gtttacattg gagcaaagaa ggaagtgaat 
218641 ctgttctaac agaatatacc tggcttgtaa agttgtcaaa aatgacgtaa ggatattgct 
218701 cagacatctt ccccgttgct ttcttgcttt ggatatcata ccttgttacc taaaacaaag 
218761 gctggagttg ttaaaaaaag gaaagatttg aaaagacaaa aatcattagg atgcagaggt 
218821 acttaccaca ttaagtaatt gaaagtaagc agttggtccg aatgggagat gagaaacgat 
218881 aaggccatca ggcctaccac ggttctcagt aacaaatatc acatcactat aatcatgcga 
218941 acgcgccatc tcaataatct cagaaatgac ctgcacaaag aaacaattca gtatataaat 
219001 gggtattgtg ttgtataaaa gtcactacaa gtccagaata ctcaaactcc taatagactt 
219061 actcatctac acgttcaaat ctttgaacct atgttacgcg taactatctg atgtaagtaa 
219121 aggcatgaat gcgatagtta cagccagata acaaaatggt ccaaaaggga gctaaatacc 
219181 attagagctg aacttaacga caaatatgag agaaaccaca aggagtgacc aattggaaaa 
219241 caaaactaga ttatataaga tgatgattag atttagtaca gaaaggcaga agatgacaaa 
219301 tatatacctg actaccacga ttcattctct tggcgttagg aaatacattc ttcaactcct 
219361 ggcaattaaa ggttataaat caatcaccaa gccccgaaac gttgtaaaaa gtaacacgga 
219421 agaaaacaag ccacacaata attaatggta aacatctatt caacgagagt actgaaacta 



149 



WO 03/006622 



PCT/US02/22217 



219481 

219541 

219601 

219661 

219721 

219781 

219841 

219901 

219961 

220021 

220081 

220141 

220201 

220261 

220321 

220381 

220441 

220501 

220561 

220621 

220681 

220741 

220801 

220861 

220921 

220981 

221041 

221101 

221161 

221221 

221281 

221341 

221401 

221461 

221521 

221581 

221641 

221701 

221761 

221821 

221881 

221941 

222001 

222061 

222121 

222181 

222241 

222301 

222361 

222421 

222481 

222541 

222601 

222661 

222721 

222781 

222841 

222901 

222961 

223021 

223081 



aattcaatat 

ccctagacgt 

caatatgact 

cgacaaacca 

actacacaaa 

agttcgaaca 

tattaacaat 

aaaaaaacat 

agcttcgcct 

acccaaatca 

atcagtgtgg 

cttcaccttc 

acatcttgat 

ccagtttggt 

acagtcttat 

caagctactt 

aaatttttga 

gtgtacagat 

ttacacctaa 

acaagtgtgt 

gtaaatacgg 

cccaacaaaa 

tccggtaaaa 

aacaattttt 

cctacatgat 

tatatatata 

tctcatttca 

gatgttagat 

aatatttatt 

tatatataca 

gattcataat 

tcaagttaga 

aagttttgaa 

gtgatgcata 

tttggtatta 

aattgaaata 

tcattttttt 

acaataacag 

ggaattttta 

cgtctcttat 

ttagtttttt 

ggtctgccct 

atgtaactta 

tcactctcac 

tatagcacaa 

gtaagttgtg 

cactaatgta 

agagagggag 

ctcgggagac 

gtagggcaga 

gcgaggaaga 

aaagagtttt 

ggacgatcaa 

cccggcgctc 

aatgcagctc 

gtggaagtga 

gtgactacta 

ctgttgatat 

caagatccca 

aagtaattaa 

gcctatttac 



atatatatat 

agtcatcaaa 

ccctggaact 

caataatgac 

gtatgaacgt 

gaatatattg 

aagctccaat 

gaggattctt 

cctcgtttcg 

caacaaaaaa 

accttgcagg 

caagtttttc 

tcgtcgtgag 

aggaaagggt 

aataataaaa 

ttaggccact 

cagtgacgca 

gtgtccttgt 

ttaatagtta 

taggtatgtg 

tgatctacaa 

tctaataaaa 

accacaaaat 

atagtttatt 

atattcatat 

tatatatata 

attagtaatt 

aggagaattg 

tggactcatt 

acttggtagc 

gcaatgtgaa 

agatcgatta 

gaatgcatca 

tttttatgta 

gtgaaattgt 

tgaaattggg 

gttgttttag 

taactttttt 

gactaatata 

atagattctt 

cttatttaaa 

tcaccatctg 

cagatggctt 

ttgaataata 

agaatctgaa 

agttgaatct 

gcaacaaaaa 

agatcttatc 

gtggcagcat 

agactagata 

tggacaagtt 

gggaatacga 

gggcacagca 

ctgaacagtt 

ttacggtgcc 

atggtgtgaa 

cttgttcttg 

cgaaggtatc 

acacaagatt 

tctttttcct 

tgttgtagat 



atatatacct 

atcttgggat 

actttcacaa 

actgacaaca 

atactaatta 

aactatgtat 

tatataacac 

accggaggtg 

gagctcagat 

ttatagattg 

gcttccctta 

atgtagagat 

tgacagagag 

ttcaaaagat 

aacggcaagg 

tagtatgtgg 

gggtttaact 

gcttgtatat 

tcttagtcac 

ccggtgtttt 

ctaattcagt 

ccaagaaact 

ttaaaactat 

cagatcgact 

atttttactg 

tatatatata 

gtcatgtcta 

gtgttagatc 

tggaaagaaa 

atattattat 

aattagagtt 

taaaatataa 

ctaaattaac 

gtaattttta 

tttttctttt 

catggttgtt 

atatagaaac 

tcttcttata 

aaaaatgtct 

agttttttta 

attaagagac 

atttttacct 

ccagattcct 

ataattggat 

gcgaaaaaac 

tctccttgtc 

aaaaaacaga 

catggcggct 

ccgatgtggg 

caacgatggc 

tSgtggctcg 

ctacgagagc 

gcaagaggtt 

tagaaagctc 

tttctttcac 

acagagaagt 

tcatgtttca 

tagagaacag 

tcttcaccaa 

caccatctaa 

tttgaagata 



taacgaatcg 
ctttttccgt 
catgattaga 
agagtttcaa 
cgcaaagaat 
tgaaacaaca 
aagtagaatt 
ttttgatctt 
ggaatcggct 
tttaagagag 
tcttccgctt 
actcctttct 
agagagagag 
gcaggagggg 
tgtttgtcta 
gctctaaata 
attgagttgg 
attttctatt 
tctgttgtat 
tagacatgat 
ttatattcaa 
agtattaact 
tgcgagatca 
ttgaatagta 
atttaatata 
tatatatata 
cagacaactg 
agtgaagata 
cataagttat 
tatgttcaga 
gttgtacaaa 
attttctgat 
atcttaattt 
agtagttttg 
gcttggagaa 
tttcggttta 
ctttcgtaaa 
aggctatgtt 
cttaactttt 
tttaaagtta 
agtttttata 
tccagtattt 
ctcgagtttg 
ataacgaagt 
tgttttaaga 
tcaccctttt 
gcacgttggc 
attactagtc 
atctcggagc 
ttagctgaga 
aagaagaaga 
ttcgtcgagg 
gtgagagagg 
ttcccaccga 
tggctggttg 
ggggttcgta 
aaattgtcac 
tgggtgtgta 
tgagtttggt 
tcccttcttg 
tgagttgcga 



cgttagagga 

tgcttttgca 

caaagattat 

acgaatgagc 

gaacgtatac 

ataacttgga 

tagtacacaa 

caagatcaat 

tcccttctgt 

ttaaaagaaa 

cttctcatag 

taacctaatg 

agggtggggc 

aaggcaacta 

tgaaaaggtg 

ggccacaaaa 

tgacccaatc 

gttccatcta 

aactagtgtc 

tcagatttgc 

cgaaaatcct 

tgtgatccga 

actttgatat 

tctgatattc 

tatatatata 

tgtttaagtt 

gtttttgagt 

tgtcatacac 

tgttttctag 

gctatttata 

attacagttt 

tgaaaatctg 

ctatttaatt 

tcttgatttt 

aaatcatcta 

gtttttagtt 

tatgaaagac 

tattgcaggg 

aactaataaa 

aaagatctaa 

ttaagctaaa 

actgcggcta 

gagagattac 

gttctatagt 

ggcaaacaaa 

gcgtgtgtgt 

agggaagagc 

tccaagcaat 

cgagcggaga 

gagtgttcat 

agaaggatga 

tttcaaaggg 

ttcttctctc 

cgaaatgggc 

gtccttctca 

taaagaaatg 

agactttgct 

ggaatgtgtg 

ctcccactca 

gggttttgct 

gatgatatac 



gcacttggat 

tattcatcat 

ccacttttag 

gtatatacta 

taattacgca 

acctcgagct 

caggcttcac 

ctcctcacga 

tcaaacataa 

aaaataaaaa 

agccgacgct 

ttcctggcaa 

taccaaggaa 

caagattgcg 

aaaecattag 

caacactacg 

tctcaaatta 

gaccagattc 

ctttcgtcac 

ggttaaaccg 

tcaattaaaa 

cactggttaa 

tatcagtaat 

aaaatattgc 

tatatatata 

ttttggttga 

gtgtggtttg 

atgatggact 

ataatattga 

aaatcagtga 

tgcaagatgt 

ttacaaggta 

tttgtagttt 

gtaaactagt 

ttttatataa 

ttttttttgt 

atatgcgata 

tttttagggt 

aaataataac 

aaaacggttc 

aacattataa 

actaaatcaa 

caatggtaag 

ccacacatta 

gtggacactt 

tgtgtgtggc 

ccagacacga 

ccatctcaaa 

gccagctccg 

ggggctgttc 

gacgaaggag 

agtgatgcaa 

catgctccct 

tgcggagttc 

ggtcatagaa 

caggtacgct 

tttgctcata 

tgaatatgtg 

ccatgaaccc 

tacagtactt 

ggacaagtgc 
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223141 ctcctacatt tgaagaagat ccagccacaa aacaaccttg tttagcagac atatgtacgt 
223201 cttacaaagg cctcgtatct tagtatgtaa tctaacacat gaggatgttg cttgcctgca 
223261 ggttctatag ccaatccgag ctccccagtc tgtcctaaac tacaagcatg aacagcattg 
223321 gatcagctag atgtgctatt tccaaagata tacatttaca tactttttct caaacataca 
223381 cagaaaagtc acaatattgt agagatttga tactggttag tattattatg tgtttctatt 
223441 ctggttgagt tgtgaaagta tccgataaat acagacactt gaccttactc atctgcatcc 
223501 cctctctcaa tctttatcgt ttgtggagtc acatgtttca gcttattaga ggattcgttt 
223561 gaccgaggaa tcatataaaa gagattgaac atagtgcttt tgttagttac tttttttttc 
223621 ttgtaaaacc ttttgttagt tactagagag acaaattaag aaatgtggaa caaatgagta 
223681 caaacagaaa cagtgaatgt gatctgggag gtgatcatta aaagcacatg gaagttttgg 
223741 cccaacaaac aatcttgatg agactaccag aaacattaat ctgtgattgt taataattag 
223801 tactagcttt tgacccgtgc gtccgcacgg gtttttattt atgtttataa cttttaccaa 
223861 tgtcatagag ataattattt atgttttagt tatatattca tattcttgaa gaaattttgt 
223921 cccgttcgat gtgtttagca tccatttata taaactataa aattaggttt tttcattata 
223981 atgcttttgt ttattctaat tggtaaagta aaattaaagt gtaaatgtta aagttttaaa 
224041 tatatatttg ttggttttat gaaatagtag tattaatatt tggactaaag gttgaatatg 
224101 cataacaaat ttatattgca ttttaaaaaa aagtatataa gttaaacaga aacgttaaat 
224161 tatgaaaaat aacatacaaa agacggacat gactggtttt tccactacca ccctcaaacg 
224221 cagctctctt ggttggtagc ggttattggc gttttagaac aatcataaaa atgctacaat 
224281 cgttttatac tgttctgaac ctctgaaaaa caaaaattgg ttctagatag cgtttgcggt 
224341 tgcgggtaga taaaatataa cagttaaaaa tagaaatatt caaaaaaaaa aaattgaaac 
224401 aatataaata gtaaaatgta tatattacat tttaatttat attgtaaaac tttaaaagga 
224461 aactatttta tataatttaa aatattaata gaataatttt ataaatataa ttatatattt 
224521 attataatat tataatttta ataatttata aatatataaa acgtgaatat tgttaattta 
224581 ttatttaaca gctactgcat ttaatagtca acagttataa gtatcccgca aatacatcaa 
224641 tttccaaacg ctatactaat tgtacaaatt ttaaaaatgt ttaaaaccat aattacctgc 
224701 atccacaaac atccgcaacc gtaaatacaa ccggtgcact tataccagtt atgtcttcag 
224761 tatatttttt ttgcaaattt tattaagaaa gactaattta aattaaacaa ttggcccatt 
224821 ttgttattat ttaaaattca gtacttaata aatatacaat attgatcgtc taa'cagctac 
224881 aaacataaat tacaatacgt aatttataaa acaaatgctt tgtgattata acttgggcca 
224941 tcaataataa taagcccaaa actaatctgt ccaaaacata ttatttattc caaacaaaat 
225001 caccgaaata ttaggctatt aaacgattta gtcttgttta ggaaatacat taaatgatag 
225061 agtttaattt taatggcata tgattgtaat tatttaggaa aatgtagggt taattcctat 
225121 ttgtactcta gttttaatag tttagactag agtttgaccc gtacgtccgt gcgggtgttg 
225181 attttatttt attttcttat aaattgatta atgacctttt taaacatata agtttttctt 
225241 catttttagg ttcttacaca tccaaaccaa atccaccaga accttaaaac tccgacttga 
225301 gctccaccca aaaagttata atatccaaag agagtctaat ttcaaaaccc aaaaactcga 
225361 tatcttaaaa aacctatcat tacgtgaatg aatacatgaa tatatatgcc taaatgatta 
225421 tgtaaagaaa aaaaatgttt ttcattcaca tttggaatta ttatgattaa cacataaaat 
225481 aaatgacgtt gaattattat gataaaaaca attaatgaag tagttaaaag tgtgaaaaag 
225541 aatgtaaaga agataatata ataatttaaa tgggtaagtt ctattttgta attgtataat 
225601 taatactttc gaattatttt gtaaagagaa attaataaag taattaaaat aagtgaaaat 
225661 aaggaaataa aatatataat aagttgtaaa gttaattcta tttgtagtca tgtcatcaat 
225721 tttagtagtc ttcttacatt ttctttgtaa gggggactgt agaaaaaata tgtagcaaat 
225781 catttcgtaa ataatgtcta ggagatattt ataaaaacca ttttgtaatc taggtgtaaa 
225841 ctgaactaac ctgttaagag tcaaataagt gttcatattc ttcaatttta aaaatcaatt 
225901 aatgattaat tttaaaattt agtataatat tatattaatt gttcgagact aacatttcag 
225961 aaaaaagtct agaccattgc caccagaatt tgatttaata tattaaactg ggcagataca 
226021 ataataattt tgatggcttc gtgttttgat aaagtttaat tcaaatgatt attgtataag 
226081 atataaaact cttgaataat gtataaaata ttaaacctga acgtaatgtt tttggtaagt 
226141 gacctctccg gcgtaatctc aatccagtta ttatttaaat taatcattaa ttgtttgttt 
226201 taattacagt ggtatgaatt gtaattattg tggaaaaatt agggttattt cctatttgta 
226261 cttcactttt aataggatag atgttaataa tattatgaca aaatataggt tcataaatag 
226321 attaacaggg gtagtttggt gaaaaaagtt caagaattca tttttcaaat ttgcaaatta 
226381 tgtcattctt ttgtatatga agtgcaaatt ttcaatatac tgtagtatga tactccctct 
226441 gttttttaaa gatgtatgtt ttaggaattt tttttgtttc aaaaagatgt ttttatattt 
226501 tcaatgtaat ttttgtcaac taattatgaa taattgtgaa tctcaaaaac attaattgca 
226561 tttcttaaaa ttttcttggt ttagaaatat aggaaatata aaataacaaa aaactatgca 
226621 ctaataatta aattttaata tgttttatta aaaagtgtga aaatctcaaa acatgtatta 
226681 tttaaaaata gagggagtag tttgtaatat aatgtacaag cctaaaatat catatggccc 
226741 aaaaattcca aaaaaggaaa aataaaatct tctattttta ttttaagaat tgttctataa 
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226801 tctcagcatt ccagaagtga tttttagaaa ataaatattt gtccaaaaaa ttatatatat 
226861 ttcatctata ctatactaaa aggcaaatac tcttaaagga gagagtatcc acgttggatg 
226921 attaaattag ccaattagag agaagttatt tatccatgtc agcattgctc tgtcgacaac 
226981 aaatttggtc attcatatgg gcttctaaat ttcgaaatgg tccatgttaa gccttttcct 
227041 tccaagcttg atcgtccatg acctttacac ttcgtcttca ttgtttaact ctcagaaatt 
227101 cgccgttacc acgatcactc aaatcgattc aatatgttct ttactacttt cttcatgatt 
227161 tcgtttcacc tcctttgttt ttcttcctat ataaagcctt ttacatcaac ctaaacctgc 
227221 agcctccaca gaagaagata gagctggatt cagatttcat tcgatgaaat agttcaatga 
227281 gaactgcaac aggagagccg atctataagg taaatctaat atatagtttg ttggtaatgt 
227341 attattctct ctccctgtag gtgttagatt caattatccg acttatagct gagacaaaca 
227401 ttaacaccgt ccaaatacgc agattagcta atcgatggac aaaatctaca ggagctccaa 
227461 ctactactca tacttgacag caacagaagt cgcttgcgta catctcctcc tttacgttgt 
227521 tccgatacaa attccctacg gctcgggaat ctcgaggttt tgtattatgt attatcttcc 
227581 tcgggtcgtt cacaggtaag gaaatagtct cactgtttgt aagaattttg atctgatcac 
227641 ttcctttcag attttgaatt actccgcttt ttgaggacga cagagtttcg aattgaaatg 
227701 tgatggctcg catgaacctc tctcttaccg aacataaccg cgggagagat ccacgtcatc 
227761 atgaagtaga cttcgcaatc ttctctcaga aactctgtta ctctctgctg aaacttcccc 
227821 gacacgtctc tgcctccaaa aacgtcgaga acgtgaagat tgttatcctt cttgactgtg 
227881 aacgtatagt tactttgcto tctctctgac ctcagaggta agtaaggaac aactgtcttg 
227941 atctctatcg tcacaacctc tgtgaagtct ctaaccagta catgagagac gatgaagatc 
228001 gtgaaaacga tgagagatat gactgaggct gcagatgctg tgaagagtga tgatcgtgag 
228061 aggtttaacc gccggtgatt gaacatcgcc gtgaatttct tggcgatgac gttatccatt 
228121 ttggttaaga attaaaggta ggaggagaac tcagttttgt gaagtctgtg ttttgttgtt 
228181 ttcttatagt ttgttttttg cttggggttg aagctgtcgt gttctgaaaa gttcttaagc 
228241 cgccagagaa cgtagacttc atttttttat tattattatt aacgagtggt tctcaactaa 
228301 gcctatagtc tgattacaat gcaagggaga gaacaaaacg gtgttcatta tgtgaaaata 
228361 atgaatgtgt ctaacgattt caaaggccaa gaatcttggc aggttcctgt ttttctttca 
228421 aaatgtaaat caatcgtgag ggtgagacag ctggttactt gctgattttc gcttctaact 
228481 atgctgttga atatcataac tttagttagc aagactttcc gacatattta atatttagtt 
228541 taatcttgaa ccagtgtaaa atagttacct cttcttagat attgtctaat gtctttagga 
228601 ttgatgtttt cttactgcat tgatactttg taaaaggcga ctagaaagaa cagtgtggct 
228661 tatttctaaa tgttatgatc attatcatta ctgtctttta gatatatgtt tttgctttga 
228721 tattttcagc cactaatcca acgtgacgga ccaaagtgca aaagatgtgg aatctacgag 
228781 gaagacaact tttatatatg atccatttga cccttctcat tcaccgctga gagtaaatac 
228841 acacacgcaa ggacaacaaa tttattgcta cttttctctg ccgtggatgc tcacaagtag 
228901 gggctggtga gtttccaaga acagtatatc ttgagtttta cattcatttc tgttccttct 
228961 agctttacag tttaattatg gtgtaggggt gaatgtgatc attaaagata tcacatgctt 
229021 attttttgta gctgaatgca tccatacatg gaattgtcaa agctctcttt gaatcttagt 
229081 gtgggttcca gaaggtaaat cagtttttac tagacagttt ttttttttga agtttttgtt 
229141 tccacattga agactctcct gcttatataa ggtggcagac caaactatag tgtctaagtt 
229201 atatctccct caaggttttt gtagtcttct acctaaaccc atgtggaatc atgcattcat 
229261 gcctatgtat ttgttttcct cttagtttat aaaattttac acattaatct tatagagaaa 
229321 aaattcctgg cactataagt tggtaagtgt tatagagaaa ttttagaatc tcaaatgtta 
229381 ctttagatta gaaatactgc gtcatgtgtt agcatttgtt taaattattt ttaaaagtaa 
229441 ataaatgata caaggttcta gcgacaactt gctaaaccct tatttttgtt ttaacccagt 
229501 cgttgctaaa cccttatttg attagcggag acatggtcag caggaacaag gttggtagat 
229561 tcacaccttt gatctcagtt tccttagttt gcacataatt tcacagaaaa atccagattt 
229621 tctgattatg gattatacag gagtgtggca aaattcctgg agaaagcgtt tctggagcaa 
229681 agggagaaag gtgaagaaga aaatgcgagt ttgttacaga tttgcagagg aacaactcag 
229741 acggaaagcg caagacgctt ctacgagact ttggatgtca catctggttt ctctgtccgg 
229801 ttcaactttg ttggatttat cggttttcaa gtattggaaa ctctcttaat tgatttgaga 
229861 tgggcactaa cgtgttctag gtaaaaatct ttgccaagac tgatgacttt cagaatcagg 
229921 ctcactttca tgaagattat gcaaggagat ttaaaccaaa gatatttacc gcttctgaat 
229981 cttgccagag caagttgtag ttctttgatt tcgttttaat gaatatattt ttaagtgcag 
230041 ctcaacatat aaaatgtcat atttgttgga tgttgagcat tagatccata tagattttct 
230101 aaaatcgtat ttcatttgtt ttattttata tgtcgtttaa agttatctca cacatattaa 
230161 aaactacatt tttgttataa ccctaactaa tatactattt tgtctgtatt tgataattaa 
230221 taaagtaaga gaatatgaaa aagaaaagaa taggaataaa ctgaataaat atctacattg 
230281 aaaatataaa agacacctaa aatgacacaa aatttgaagt ttagagtgat aataattaat 
230341 actccttctg tatcactata atgtgtttaa ggatttcaat tttattccaa aataagtgat 
230401 gttcttttgg ttctagataa atgtagtttg aaatttgtga ccaattacaa aatactattt 
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230461 tttattggtt aaattaattt aatttaatat tattttatat aaccaaaata aaatattttc 
230521 ttaatcttgg tgaaaaacca taaaaacctt ttaaaatgat acaaaggaag tataaaacag 
230581 aggatataat gaaaaaccag acatgaaaaa aatacaattt ataaaaccgc atataaaaaa 
230641 tggaaactca gaattgaaca gagaaccaaa aactatttaa ataatttatg aatctaacag 
230701 tttttatagt gacaactcaa ataggtagag tgagaaatca aattatattc aagacttaat 
230761 tatttttttg ttatttctat ttgacatata ttttatatat taaaacaata aagtttagtt 
230821 ttaaataaat cgaaaaccat attattttat tttgcaatgc agtaatatta tacatgtata 
230881 tatatatata tacttacata tttgagttgg aaactcagaa ttgaacagaa aaccaaaaac 
230941 tatttaaata atttatgaat ctaacagttt ttatagtgac aactcaaata ggtagagtga 
231001 gaaatcaaat tatattcaag acttaattat ttttttgtta tttctatttg acatatattt 
231061 tatatattaa aacaataaag tttagtttta aataaatcaa aaatcatatt attttatttt 
231121 gcaatgcagt aatattatac atgtatatat atatatactt acgtagaata atatattaat 
231181 ttatatttac ttaaaactta tggaaataat ccgggcttag cccggaaaag gttctagtat 
231241 ttctaaaggg tcggcattct cgaagatcgg tctttgaacc aatcacgagg ttgctaaatc 
231301 ctaggtgtcc acacaaaaaa acgagttgaa gtcaaagtta ttgataattc aaagaaagtg 
231361 atttgatttt gaagtttctt cgttgaaaag aaaaataaat aaaaagagag agagaagagt 
231421 gagtgaagga gaagacgaca gacaagtttg cagtttcaat ttaaaggttt ccgcgtacag 
231481 aattaaagta tatcagatct attcacataa cttgttttcg atctgtaccg tttactgatt 
231541 tttagggttt ttggatcttc tgtttagcaa tggcggataa ggccgagaag atgaagctcc 
231601 ggcaagatta ccggaactta tggcactccg atctcatggg aaccgtcacc gccgacactc 
231661 cctgtaaatc tctctctctc tctctctctc tctcttttcg atttctctaa atcttaatca 
231721 aagaaattga tttttctttg tgcagattgt tgcttgtcgt gtgtgtggta agattagggt 
231781 ttgttgttgt tgtttctctg tataatttgg tacacagttg atatctcctt ctttgtttgt 
231841 tttattgctt gcttttgtag tggaccgtgt gtttcttact tgcttcggag aagagcactt 
231901 tacaatgaca tgtccaggtc tcaacttttt tttttttatc tatgtacatt attattattg 
231961 gtattgaaga tatttatttc tccctttgct aagatattgt gagtctttat tcttgtaggt 
232021 atacttgctg tgctggatac atgccctgta gtgggaggtg tggagaaagc aaatgccctg 
232081 aactctgcct tgctactgag gtctcacttc cattaataca tatatatcat tacattaaag 
232141 ccgcttgttt ttcggtgaac aaaatgaaac agcttacatt ttcttgatca ttcttttttt 
232201 cccctatctc tgtggcattt gttctaatat gagtactttt aatccctttt agacaattat 
232261 ttcctctttt tctgtgtgtt tgtactctcg tatttctgat tgctttgctc atttcatttt 
232321 gactgagaga cttacattat ctctgagatc catacatacc aatgaactgg attttttttg 
232381 gcttgctggc atgagtagtc aacttgctat tattcaatag tagatctctt gatgattcga 
232441 catttttttt ttaagaaact tgtaattgtg attttttagg ttttcctctg tttcggaaac 
232501 tctgtggcct ctactcgctt tcttctccag gatgaattca acatccagac tacaaaatgc 
232561 gacaattgca ttattgtact ctctttctct ccttctgttt taacttgttt tacacagaat 
232621 gcatatacac atctcattct catccttggc gttttgcagg gatttatgtt ctgcctaagc 
232681 caagtcgctt gcattttctc tattgttgct tgcctcgttg gtagcgagga actctctgaa 
232741 gcttctcaga tactctcttg ctgtgccgat atggtctact gcacgtatgt ttttactctt 
232801 ccttaattct tttcacttta tcctcttagt atattaaagg taaaaccaat gctaatactt 
232861 ttacgaccgt cttggtgtta tttttttttt ttcagggtct gcgcgtgtat gcaggtataa 
232921 gaaagcgtga actactaatc ttttagcttc tgaaatatat tttcttatgt tatatataaa 
232981 ccgaagacca atgttatgct tgcagactca acacaagctt gaaatggaca aaagagatgg 
233041 agtgtttggt cctcagccaa tgggagttcc accggcacag cagatgtccc gttttgacca 
233101 acctgcccct ccagtcggct accctcctgc gtcttaccca ccggctcaag gctaccctcc 
233161 tgcaccttac ccaccggctc aaggctaccc tcctgcatct tatccgcctc ctggttatcc 
233221 ccaacattga ggaactttaa accaatatct tcatattgtg tgtcaaagtg ttgttgttat 
233281 tactgttcag ctatttttct cagttatgtg gttgctttgc aagtattagt actctggttt 
233341 atgcactaaa acatttggta tcatctggtg tgttgacttt gttggtattt ttctcccatt 
233401 tatctgatga acgcaaaaat gtgtaaacct tgctatgcta tatgtgtggc tgtattcctt 
233461 ttatctttgc attatctgta ctgtgatttc tttgcaattg cctaaccagc ctgttcaagt 
233521 aggctactta acactgtcaa gtagcccaaa acaatatttc ggtattgttt tgggatgctt 
233581 ttcacgatga cctatgacca aaccggtaac atacaagggc ttgccaatta taaaagcaca 
233641 tgcgcaagcc tttctcccga agctgaatag atgagcggaa tactgatcct gaggactagt 
233701 catggcatta ataaccacaa gttggaaact aatgtttctt tcttctttat tttagttata 
233761 tctgttggca ctctgctttg tattcctaac atgagaatcc gagtagtatt ttttattttc 
233821 tcccgaacct gatttgtcta taaatatctg ttctttataa taagatattg cgcacgaggg 
233881 tgggacgtgg aatcatcgag gatgggctaa attgcttatg aagtttattt aagtcttgta 
233941 aaaatcttga aaatattttt caaaaacaaa aaaaatcttg aaaatatata acaaagctaa 
234001 tgattcaatt catactgcct tttacgcttt taagtcaaag tgggtcaacc ataataacat 
234061 tggaagagtt gtctatcgat taatttaacg acttctccca ctatatttag aaaattcagt 
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gtagtacgtt 
tttcaaaatt 
gataaaataa 
ttcttcaaga 
tttgcaattt 
tgttacttca 
tttcaacata 
atgagtagaa 
agtccatgtg 
catgcgaagc 
aattatattt 
gtctttcagt 
agcttaccaa 
tcgtaggccc 
attctttcat 
catacaacta 
atacttggaa 
ctttatttgc 
ttttgagtgg 
ttgtcgttct 
ttaactctga 
tcatggatct 
taaatttaaa 
gcctactatt 
atattacaca 
attacatata 
ctgggacttg 
accccaaact 
gaaatattac 
ttttctttct 
atttattttc 
ttctgaatag 
tttttaattt 
agaaaataac 
gattgagcaa 
tccgtttgaa 
taatccataa 
cgtcggatct 
ggtcggttta 
gaaaacaagt 
gaagatgtaa 
gagattagtt 
aggagctcgg 
agagagagaa 
gtatatatag 
ttattgctcc 
tttatttagt 
agagtaacta 
aggatgagga 
attttttaaa 
aaagttcaaa 
ttttttaatt 
gtatttagtt 
ttgtttagat 
gttatttgaa 
ttgcttgatg 
gcatagacta 
ataaagtatt 
ttcgttaggt 
tacatgcatt 
taaaataatt 



tgaagtgtaa 
taatcaaggg 
tagaatctac 
tctaaaatta 
agcggcggaa 
tggtttcaca 
atgtttttat 
gcgtgtgggg 
tgcgacgtgg 
ttcttcgcgt 
aataccccaa 
tcatagaaaa 
gatggactat 
cagattttag 
atataagagc 
tacaaagcga 
aatacattgt 
ctatttttgt 
agagcatata 
ttgactttac 
gataagttca 
tttatcagtg 
aagttaacgc 
tgacactgga 
tttttttaac 
cgattcgaca 
aaatctggat 
tgggtgtaga 
acatttatta 
caaaataaag 
acggtaacag 
atgatacaag 
acatccatca 
ctttcatgtt 
gaagttcaga 
gctcatcagg 
tccaacggtc 
catcgtcgtt 
gatccgagaa 
gagagtcatg 
ccggttcggg 
ttcgtatcaa 
aatagcccac 
atagagagag 
ttggcgtgtg 
gttaatttaa 
ttcttttcaa 
tatatttata 
ggttttggag 
agtattttcg 
tttgaaaaca 
ttattattat 
ctttaataaa 
gtcattttct 
atgctttaat 
aggatatagc 
ttgattgttt 
aatgtaaatc 
tcatttcatt 
tatatttata 
ttttaaaaaa 



agaaaaacta 
atgtgctagt 
aagttgatat 
cattagtata 
aattggtatt 
atttgccatt 
agattctatt 
ctcgtattag 
cgacagcata 
cacgctctct 
tgattaagtg 
ttctacaaaa 
gattaagctt 
ttcaaacact 
gtgaattaca 
gaattataag 
ttaccaaatt 
ttttttggag 
gcctcttggt 
aaatctttaa 
tctattttcc 
aatagttatg 
gtgatttaca 
tgttttttaa 
gctgatttat 
actgacaata 
ttgagaaaat 
aaactttaga 
atttgttaat 
ttgtttttgc 
aaaaaagagt 
ttttcttctt 
gctaccaaac 
accctgtcaa 
aacgtcttca 
tacaaactgg 
caagcattct 
ctcgaagtcg 
tctgacaacg 
gtggcttgag 
ttccaagaaa 
acctaagaaa 
cggaaaaccc 
agacttttgg 
tatgtgtata 
tggagtactc 
taaaaaagtt 
tttttagatt 
gtgtttttgt 
aatttataaa 
cataattcaa 
ttatttatat 
acttattctg 
tctttaggaa 
ttgtttctgt 
tacgaatgtg 
ttgttcatac 
cagaaataca 
ctcagtttgt 
ctttagagat 
aaaatttacc 



acttattcag 
atatgctcca 
ttcttcacaa 
ttgttctggt 
tttttaaaat 
ttttgtgtac 
tatttttgtg 
tttagaaata 
acatattcca 
gccttcatac 
ttgattctta 
ataaaagcaa 
cgctagaaag 
tacgattaaa 
agacaaatag 
aaaattgatg 
ttttttttca 
aaattatttg 
ggaggctgcc 
cttaactctg 
gtgaaccaca 
aaaacctcaa 
atttgtagtg 
gatttcttga 
tatgatatta 
ctcatagttg 
ctgcaataaa 
ccttaatcat 
gagcaagtcc 
ttagataaca 
gcaaataata 
cctctttcac 
acaatactac 
gcgagaggaa 
ctagattccg 
gtacgacaaa 
ttatggaaaa 
tagagacaca 
ggcaagatct 
gtggtggagt 
tcgggtaaac 
gagagtacat 
attgcttcgc 
attgtgatga 
ttaatattgt 
aataatgtga 
ccaaaaaaaa 
aattaattta 
aaaaaaatat 
aaaaatgaat 
atttataaaa 
ttatatatct 
gtcatttctt 
cttttttatt 
ttgcgtggtt 
gtaatgatct 
tccctataaa 
tcaaataatt 
cgacacctat 
tgtttttggt 
ttcaaagtag 



gtatatgtag 
tatcgcagga 
tggtaaggta 
ataacatgtt 
tatagttaat 
ttcatggtat 
gtatatctga 
aataagcaca 
atttccaaag 
tcttttgcag 
agaatgcaaa 
caatcctatt 
tttaatatgt 
aatcgtataa 
agatgatgaa 
aaaaaagttc 
agtttatggt 
cctatattag 
tccatcgttg 
agataaaaaa 
tgtgtcaaaa 
aattttcttt 
agaaactagg 
caattttaat 
caattacaat 
cctaataaat 
ttgcatagtc 
tatgctaaga 
aattatgttt 
aaaaacctgt 
actcaaactt 
tagcctgtaa 
taacaaaaaa 
aaagaactat 
accaaagctt 
gcggacacgt 
tatgcctgca 
ctgcacaaca 
gtccagctag 
ttggtggtgg 
caatgaacca 
gaatgaagat 
cgttagagag 
aaagatggag 
atatgtgatt 
aaattttcgt 
gaaaatgata 
aactgactgt 
taaaaatagt 
ttgtttcgaa 
acattcttta 
ataaatcaag 
cttttctttt 
gacaaaaaac 
ttctttggta 
ttctttaggt 
gataatagaa 
aaggtaaaga 
attgcattat 
taccgtgaaa 
gatattttca 



atttatcata 
aactggtact 
aaatagtcga 
tagttcgcct 
tatttattta 
tgtttgaaat 
gagccaaaaa 
cgcacagaca 
tttcccaaca 
gaacctctca 
agattaacaa 
tcaagtatac 
acggtattaa 
aaacacccaa 
aaaaaacctt 
ctttacttag 
ttttgatgca 
tccttttaac 
attcgttgcg 
aaaaaataac 
ggaaatagct 
tgctatattt 
tagcacagaa 
aaaataggaa 
tattaagaac 
cgcactctct 
tgagaatcaa 
tgcttccaca 
ccatctaatt 
gaatgaacag 
tagtttcctt 
ccactttcta 
gtattaacaa 
gtcctaagat 
ttgactaaag 
catctgacta 
attcgtcagc 
ttcggattcg 
cagcgctgct 
gtcgggccat 
gaaaagtgtg 
ttttgggagt 
aaaactgtaa 
agggtgggat 
tggtaattaa 
aggataatct 
tttttatgaa 
ttaggggtaa 
ttttaaaagt 
aagttattaa 
cttttttatt 
aggtagaaat 
tctttttttt 
tttaaaaatc 
tttaattatt 
tatagtatat 
atcaagaaat 
cagatcgtta 
acatgctatg 
ttaggatctt 
catcaatgta 
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237781 gattacaaaa aaaaaataaa gaatgaaaat tagattctac tctacaaaat caatgaaagg 
237841 acatcaattc actgtactag aaaatcttaa ttaatcgtca cattttggca tggacaaggc 
237901 gaaaatatat ttatcatacc aagtgtattt gatatcacaa tgatgaatac acacattgag 
237961 ttcagtccaa aacaatgttt gaatcttcac atgactagta gagatcatca gtctacatat 
238021 acgtcatttg ctattatgat tcttcatcta ttgacttttg atatgtattt ttatgaccta 
238081 aggtgaacta cgtagaaaag cctttgcagt tctttaagtt tgattttgat tcgtacttct 
238141 agaatacaat attaatttac gtaaccaaaa tatttgaaga tctatttcta gagaggggga 
238201 gtcatatggg tatctgactc aaatgtatga aagagctaaa gcagctctga cccacctcac 
238261 accctcccca tcttctcctc caacatttca cattccgcca ttcaacactt tttgcaacta 
238321 tatactgtat acactcacac acagacctta tatatatata taaggttgtt tattcatcta 
238381 tgtaacttga acaatatttc aatgaattaa aaaattccca atttaattgg atttatatgt 
238441 ttgataaagt agatgctttg tcatactttg gaaaaattga atccaccaca tgtagaaggc 
238501 gtgagaggac aaaaatcacc aggctatttg tctcccacgt acgtttatgt acagcaagtt 
238561 taccacacac tcatgtttat attacatagg aacaagggct tacagaataa ttaaaaagcg 
238621 tcactgttag taagattata ttaactgata agccagataa atccagccaa gattaatcct 
238681 ttttgtctgg acaattaatt taacaaccaa aattcggtta agaaaaacac ggaaaatcaa 
238741 ttattttttc tttcaaatta cttaggaact gcagtgttca aaatttatag tttataccct 
238801 ggcaatttac attcacacat aagggtttag gtaaggtgtc aacaaagttt taatgatcgt 
238861 tgatgactga tgtgtcatgt gtgtacgtag aattttgatc catttttagc atactattat 
238921 ttttttgtta acatattttc ttcagtctac atgcaaatat caaaactcta aaatgaggaa 
238981 aagtaaataa tttagaaact tcaaatatgt tagaaacttc cggtcctctc gtaacagacg 
239041 agttagtaac tgtgatatat atactctatg gattggatta tgacagccct tagtttctct 
239101 agtttctctc ggttattttg taagttcata tattcatact tatcaatcta aaaccgtcaa 
239161 tatttgcatt ggtcatctgg cttgaaccaa tactggagtc actttgtaaa atgttaatat 
239221 catatgcact gttgtactat gtacatgtaa agagtttgtt gtatatattg tattttatgt 
239281 acggatatta ctatagtcat gacagcatgc tttgcaggat ttaaattatt ataagaaagg 
239341 gaacagtaaa aagtacttga ggaattttgc tttaagtttt aaaagtagag ttaaacatat 
239401 ttaacttgaa ccttgtctgc taattcaacg gaaaactaca aagtttattc cagttttgca 
239461 tgtcagagac tgtcgttttg tcagcaagcg accaactaat cacactaatt gttttaagca 
239521 accagctaaa ctataattaa agtatccaaa tatatatact tgtctttgtt tatattcaat 
239581 aatcaattca atatatacaa ccaatttcat agttatccag attaacactc acttgactct 
239641 tagatcatta ttgtttgtct attttctcag atatattttg gtttagttta ttcttttttt 
239701 ttggttgaag aaactgaaat ggtgagaagt gaggcttgat ccaaggtctg gtgggagagt 
239761 ttttttactg gggaaacacc aactgatcta atgaattttt tttttggttc agtatattca 
239821 tgtgaatatt cctaattttt gtccttgtta tatttttaag aaacttttga gctattccga 
239881 taatgaaatt aatccgaaaa tgaaattatt ttataactat tgtaattatt ttctcagatg 
239941 aacaagtcac actatgtcaa tgattggtca tgtagactga cttggaccaa gtggcttttc 
240001 ttttctgaat gcatttattt ttctctttta atttctttga atgaatttaa gagcatcttc 
240061 aagaacaact ttatatttga agtttgctga attttatatt tgaggtttga tggtattttt 
240121 attcactagt aaactctaat ttttatttat tttatatttg tacatcatag tctttacttt 
240181 agacaaaaat caaataaaaa cataaaactt tataagaact aaaatcatag aataaaacta 
240241 ttctaagtaa tatttattaa taaatattac attctactaa agattacaca taaataaaag 
240301 attttaaaat acatgactat tcattattat tgttattatt ttcacaatgt tcccacatat 
240361 gatcaatcag ttgacatttt agttttatca aatgaacaat tccttgattt cttaaaataa 
240421 agtgcatcag ctaggggaaa attatgatat aactcaatta tgtatgctaa acaagtaaag 
240481 aaactaaact gtaaagaaat gcatgaggat agtaaaatgt tattaacaaa caaagaatct 
240541 atcgatgatc atactacttg tgagtttatc gatctgaaca agaaacattc acaaagggaa 
240601 gaactcaagc gtaataacaa cctccaaatg aacaaaaatt ttatggaatt ttttttggtg 
240661 atactagagg tagagtgata atttatcaac acaacaacaa gcaattcaag ctctttaaaa 
240721 aaataccaga aacaacaatc tagaaaatga gttcgaaact tgaacaagac aacatcatca 
240781 aacaattagt attgtcatgt tttaaacaac ttcgaaataa taaaatgaac aaaaagatgc 
240841 gttagtgtgc tcgtgttcct tgagaacaaa tggactttga gaaaattcaa atacaattgt 
240901 agcaaactgg tcgtgaaatg tattgttagg cttattaggt accaattcaa tcgtatctgt 
240961 gatcttttat ccttgctttg acacaattag aaaccatttt caccaatcgt tgtgattcat 
241021 aatcttcttt acccaatcga tcctccttct ttctcagcca aatgagacaa gctcaaccat 
241081 tagttcaata tcgatatttc ctataaacac acagaaacac gttgcagtca cacacagtaa 
241141 actctaccca aatattttct tagagaggac atcaaaggtt ctcaatctca agtacggttt 
241201 tgaagctaat atatttatta acaatgcttt gcttaatatg gataacatca aaacagtggt 
241261 tatatacgag aaagaatgta aaattaagct aagctcttgg ttgacccttt gggtagaaat 
241321 attcaacttt atgcaaagtg ttatcagagt atctgcgcct tgaccaaccc taaacacccc 
241381 gactctacta ataccatcta gagtagagta actaaggatt atcttattat gtagtaaatg 
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241441 ttaatactac ctttgacctc ctctggctca tatttttgca gccttagaat gctgattttc 
241501 cagaactcaa tacgatctca accagctgaa tgttgccgct cagtaaagat gccttgcgag 
241561 atggcaaaaa agagaagcca tccacaagtt taacgcactt cagaccccat ttcaccatga 
241621 ggaattcgat tagatgatgc ccgctgcagt ataaaaatcc atacacaaaa atcaccatcc 
241681 taatcttttg cagatatgtt atatgacaag gatgaatgaa ttgtgtacct cctgttatga 
241741 tacgtggtga taaaaactgc atctggagaa ttttgaagca ggaaagagac ggtggcaaag 
241801 agatcatcga acgctatata caaaacaaag gttcgcgatt cacatttcag ctcatggtaa 
241861 caagtaatct gggagaagta acgactccac aagggaaaat tagaaggcca aaacaaaaag 
241921 gaaacagatg aactaaccac ttgaatcata tagaacatct gctccaagta taatgttagg 
241981 ccgcagatca aatatggttg catcccacac tccccaagtg agacccatta cctgccaata 
242041 gaccaccata aggtatgttt aaatgaatgg agaaaacacc atgagacaaa tatcttggag 
242101 atacaaacaa acatatacgc tacgctccta agcgttatat atatgcgatt aacgattatt 
242161 gttaatccag cgggcgttga agttgaagaa agtgaaaagt agtaaacgtt acattacagt 
242221 tgagattgtt aagctcacaa actcttgtca tattctccaa aacctgcaaa tgtatcatgt 
242281 atggtgaaag atgagtaata aactcctagt ctctgaacca aatgagaaaa caaagcttat 
242341 caatacattt tcacacctct gttttgcttg agtcatcggt aagggtgaca ttagctccaa 
242401 ccttagcagc tactaaaccg ggtaaagaag tgccagctcc tagctacaaa ttacatcaaa 
242461 cacataaatg tcttaagaaa agaagtatat aaaagtcaaa aaaagacatt cattagctca 
242521 aacacagaac acaatgatga cataatccca cttctttaat gaagaaaagc taaaaaaaac 
242581 aaaacattca ctcaaatctt tgttctcgat ttcaattgag aaagcaggag tggtttgttt 
242641 acctcgagaa ctgaagagcc acgaaatcga gatcgctctt gccagacgta ctcggcgagg 
242701 atgacgctac aaggccaaac gaacaaacca tactcttcct tcatattcta cacacacaaa 
242761 aaatgtcaca agccaacgat gcaaaacgaa aaccatcaga agaagaagga tagcttcata 
242821 cctcgataac agatacagag atggaagttt cggaatctcc tccatcatcg ccatagaaat 
242881 catgctgaga gatggtggta gtagtcattg tatattcttg agattttgaa gttagttttt 
242941 ttttttaatt taagttgagt ataagggagg aacggagaaa cgtcccacat cggtaagaag 
243001 tgaagttgag gacactattg tttgggtata aaaaggcttt gggtgggtta gagtaaaaac 
243061 atgatccttc aggcaagtga aaggggatat gatgagtggt aaatattctc gctttatgag 
243121 agaagagcgt ttcgctccgc ctgttacgac aattgagcgc tctcctttgt cttaccatct 
243181 gatccctctc ctcattggtc actttttttt tttcataaaa caaattttgt ttattaacta 
243241 taataatagt tattgttttc tttgaaaata cattaatatt tttcaaatat acttattaga 
243301 taattacagt ttatattcgt tttagtaaaa aaatattgga actcttatta tttttagtgt 
243361 ctttttactt gaacaacctc tatttgtctt accatcttag attcaaagct cccacgctca 
243421 ataatgtcga agtcgtggga actgctgctg aattcgaaca ctatctctct tctcacttct 
243481 gatgggtcaa aatctagcaa gccagagcca aatgtaaaac aaatgttctg ttcgacatct 
243541 gatttgatga agatgagata aaaggaataa gtagatcatt agaaaacgtt ggacgaaaac 
243601 tatctagtta tcaaaaccga acgtgcacat caaagactta agtcaaaatc tttctggagc 
243661 aacctcaact tctttggaat tttgttgcaa gatcagcatt agaaaccaga aggtaggcta 
243721 agacatagca acaagcatga atgaagaata atgaaacgca acagaaaaga gtagttttgt 
243781 cttaaaaaaa aggagaagga ccataacaaa cggtttgaac aaaaaaaaag accataacaa 
243841 acggaaagtg ctcttattat gctgcatcaa caaagaaaac acaaacaata ataaaactct 
243901 aaaaactcta aaaactcaaa ccagaacctt gaagtttttc tttgatgatc atgtctatct 
243961 ttctccattt cagacgtaag ataatttttc gagtcaccga cttctccttt acggaaaaaa 
244021 aacttgtgct ccacattgtt tattgtccct gttttgttag cctccaaact gctcagatta 
244081 cgcagagagc agaggtccaa gatcccgtcc acagatccac tatcttcttc ttgcttagta 
244141 aaaggacatc ccaagaagtc cgcaagtctc ttgatctgat cacgaggctc ggctttcatc 
244201 tcctcatacc tcatgaaaag gacatgcttt gggtcttcca agcttcctct ccagtaactc 
244261 aagagatgat cccaaaaagg tccaaaatag ttggttccat cgcagaactc gttaaacata 
244321 gactcgagaa ctcttctggt tggttcgatt ttatatatag cacagctgta aaaccaacac 
244381 gagatcaacg tgtccttcac gttcctgcac acgtacacaa tcttgcaagg agagtgcttg 
244441 agtgcttcgt ggatcgtgtg cagtggcatg tgagtcgaga acagcctcgg aggtgctgag 
244501 aacttggcta ggttaggact tgagctttcg tggtacacat cgatctccaa gaatggtata 
244561 atgccatgag gattatgata taggagagga tgatcagaag agtggttctt tgatctctca 
244621 agcagagcga ctgtgagggc cttgagccaa gtagtgcctg atttggggta cgaagcaatg 
244681 attacatcag tgtcttgcgg ttgaaaacct ctctggaaat tgagacacct tggagggtgt 
244741 tgtaataata ccaacatcct tgatatttac aaatcttatg cccttggtaa tctgtgtgtg 
244801 aaggaagcga agagatcacc ttcctggttt cttcacttat cttgtcgtct ctaaggttrcg 
244861 acggaagctc cttcggatcc atttgtgttt cttggttctt gacttcttgt tactcgttac 
244921 aaaaggatat gtttaagttt caaacgaagt tgtaatagat ataagagaag aaattaagcc 
244981 agaaaacaga tattgctggt aataaagacg aaacctaatt tattatttta tcttgaaatc 
245041 ggtggtggtg gtctagtggc gcttgagaga attaaaaaat ccaatacagt gaattcgatt 
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245101 
245161 
245221 
245281 
245341 
245401 
245461 
245521 
245581 
245641 
245701 
245761 
245821 
245881 
245941 
246001 
246061 
246121 
246181 
246241 
246301 
246361 
246421 
246481 
246541 
246601 
246661 
246721 
246781 
246841 
246901 
246961 
247021 
247081 
247141 
247201 
247261 
247321 
247381 
247441 
247501 
247561 
247621 
247681 
247741 
247801 
247861 
247921 
247981 
248041 
248101 
248161 
248221 
248281 
248341 
248401 
248461 
248521 
248581 
248641 
248701 



caaaccccac 

aagaatttat 

cggatttaat 

accttaaggg 

taaaagaaac 

aagtggaaca 

ttcatttata 

tggattttgg 

tttagtcttg 

agattttggt 

aaacaataat 

aatatattat 

ctccaggaaa 

tttaaaagta 

ttgtaaagat 

atttaaaata 

cttaaattct 

aactccaata 

aataaatgta 

aatgtttctt 

atttttaaaa 

tgattcttac 

atttactaaa 

aaactttttt 

attttagaga 

aaccaaaaac 

ctattacgta 

ctactattta 

ttttatataa 

ttcttactca 

cacaaaatat 

aatatattat 

acttgttcaa 

aatgtatttt 

tagaatatct 

taaaaaaata 

cttttttaaa 

aattgtttaa 

ttattaaaca 

acttactgta 

aaacttgtag 

ataatatatg 

agtcgtgaaa 

gaactagacc 

caaattgata 

catggacgag 

gcccaaaaat 

caacattttc 

ttatggaaaa 

aaacgtaatt 

aatttcctgc 

ccaaaaccga 

agaaacataa 

attttccgcc 

caaaacagaa 

ttaaatgggt 

tctcatttgg 

cccattcgga 

ccgattgggc 

tacttgtgat 

ttaattatga 



cggccacatg 

ccgtggactg 

tttttttaaa 

cttagattca 

gaaatattta 

aacattacat 

tattatgaaa 

tttttgattt 

attttttttt 

ttttgttttt 

atacaaatta 

ataaataaaa 

atatatagat 

tatttaattc 

acataaaatt 

aaaaacaaga 

atagtatagg 

acttaattcg 

actctttcaa 

tttgcaaaat 

attttcaatt 

taaactttat 

aaaagtgaca 

ttggtttttg 

atccattttt 

tatttaaaaa 

tataattaaa 

tggtaaaata 

tattaacaag 

tatgatttat 

tcaataaacc 

aatctaaaaa 

tgaaaatttg 

atatggtata 

attaaatgaa 

ttgatcacaa 

aatttgataa 

tttattttga 

ccataaaagt 

agaagactat 

gttgaaactt 

gaaacttgta 

tcattataaa 

atgaccttag 

tggtctgaaa 

cccaattata 

ctaaaatatc 

cttcctgaag 

cataattttc 

tttcgttaaa 

caagatcgaa 

aaaaccgaat 

tttctcgcca 

aaaaccgaaa 

aaacacaatt 

gtatgggtcc 

acatggttct 

catgcccaaa 

ggtgtttatt 

acattcagag 

aataattggt 



gatatgggtt 

tacatccatc 

aaaagaaaaa 

tgttaagaaa 

aatatgaatg 

atatatgtat 

taataaaaag 

ttggcttttt 

tgtagatttt 

ccagactaaa 

ttattattgt 

taatagatat 

agttaagatt 

taaaaactat 

ttgcaattca 

atattataaa 

atattaatta 

tagggatgat 

caaaaataaa 

aaaaagttaa 

tttactatat 

atttaattat 

aatattgaaa 

tctgaaaaaa 

ttaaatcttg 

ccacaaataa 

ttggaatgtg 

attaaaaata 

gagatatatt 

aaccatttgt 

attgatcaca 

atttaataaa 

aaactaaaaa 

tagattaatt 

acttcatatt 

atctttaatg 

catataccaa 

ctattatatt 

cgtgaattca 

tgaagaccat 

actgtaagaa 

ggttgaaact 

tatatggaaa 

agatgtcaaa 

tgaacatgcc 

acatggacaa 

taggtttagc 

cctgaaagtt 

cgctaaaaat 

accggaaaaa 

aaatacaatt 

ttcctgtcaa 

aaactaaaaa 

aatacaattt 

tcccgcaaaa 

catggacagc 

atttgggctt 

cccgcccggc 

taatttttga 

gcgtatctac 

tttgcatagg 



attgctttcg 
cggaggttag 
aactaattta 
atatttaaca 
ggttaaagtt 
actattttta 
gtgatgattg 
gcttttgatt 
actttttcaa 
ataagttatt 
catccttata 
aatttataat 
ttaaaatgga 
tttaggaaaa 
aatattttaa 
tgtatagtaa 
attaactata 
tattaatttg 
tgtatatttt 
aaataaagta 
attgctttca 
atatttaatg 
accaacgtaa 
cagttattga 
attgaaaaga 
tcatcctcta 
cacataaatc 
tttctttatt 
atattaaaaa 
attattgtta 
aaatattcaa 
aatttaaatt 
attatgtatt 
taaacaattt 
tatattgttt 
tggaatttta 
aaaattaaaa 
ttagcgagta 
ttataaatat 
aaaagtcgtg 
gactattgaa 
tactgtaaga 
aacttgtagg 
tgggcgggct 
cattttgatc 
tattgggttt 
aaattggaaa 
tgatttctaa 
gaaaaacgta 
catagttgcc 
ttccaccaaa 
aatcaaaaac 
cgtaattttc 
ctcgcaaaaa 
atcgaaaaaa 
ccaacaaatc 
cgaccaaaaa 
ccgcccaatt 
tttacacaaa 
tataggatta 
atgccaaaaa 



actcctttga 
gtctgtgtct 
tttcgctggt 
tattgaattt 
tggtgcataa 
atacaattac 
tttgaataat 
ttagaatttg 
aaaacttgaa 
aaaataatat 
aatatattaa 
aaatataatt 
agaaatatca 
aaatataatt 
cacagaaaaa 
ttaaaactaa 
aaattaatct 
ttctattcat 
atatagtaga 
tattttaaat 
tattatatat 
catgaattta 
ctttttttta 
aaaaaactat 
atagtttttg 
aataattaag 
aaattaaaca 
tgatatggta 
aattgatttg 
taaaataata 
tgtagaactt 
taaaatatta 
ttatatggtt 
acaaaaaaat 
tataatcatt 
cagttttagt 
ttttgttata 
ccatgaagtc 
atggaaactt 
aaatcattat 
gttgcttaca 
agactattga 
ttgaaacttg 
ataaatgggc 
cagtaactaa 
aaccacttgg 
aaacacaata 
ctgtttattt 
attttccgct 
caccaaaacc 
accgaaaaat 
gcaatttccc 
tgcaaaaatt 
ccgaaaatgc 
tgaaaattat 
atggtcttat 
tgtccaacaa 
gacatctcta 
atgatgtatt 

ggggttcaat 

gaataagcag 



atgtccagga 
ttaataaact 
gatacgaaaa 
cattttctat 
tatttaaaat 
ttattttgca 
ttttggattt 
gtttttaaat 
tggtaaactt 
atttctttta 
aattaaatta 
ataattatat 
ataacggaat 
gtattttaaa 
taataactat 
caaaaattaa 
atctaaacat 
atttttgtag 
tgtatttctg 
aactaatttt 
gttaagcaaa 
gtaaaaaaat 
gaaaccacaa 
ttttttttac 
gaaaaaaaac 
aaacctgtaa 
attttattta 
tataattata 
ttacatttaa 
aaccattgat 
ttataatttt 
atttcttaat 
ttcttatcat 
atattttaat 
tatcttatta 
aatttgtaat 
taattattgt 
gaacaaatag 
gtaggttgaa 
aaatatatgg 
tcttgcatat 
agaccataaa 
ctgtaagaat 
ggcccgtgtc 
attttgtcca 
acaatgggcg 
tttttttttc 
gtgatttttg 
aaaatcgaaa 
ggaaaaatgt 
gcaatttctg 
atcaataccg 
gaaaaacgca 
aattttttgt 
attgtttaca 
ttggttatgg 
aaaaatgaag 
catgacctgc 
tgttaatagt 
tgacatatgt 
ctaaaatggt 
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248761 attaatgtat gtattgacac acctaaatct gagttcacat atctaaatta tacttttgtt 
248821 atcttatttt taaaatttta catatcatga tacatgtaaa aaaaattgat gggtccgcca 
248881 ttggatacat tgagttcttt ttgaattaaa aaaaaacaat caaatctgtg aaaaatattt 
248941 tttattacat agaaagctta aagtgaacaa tgatagatta tatatgaaat atataaatca 
249001 atttagttaa tttaaaatac attgaaataa ttgactttga gtcatatata aatcaattta 
249061 atcgactcat tgttattaca ttttttgtta gggccatagt gtttt agate aaattaagtt 
249121 ggtctattgg ctcatgactc ccctctctta ttctaaccga cagttttgtt ttgtaaccgt 
249181 tcggtgttta gaaaacaatt gtgaagaatg aaaaaaataa atatctcgac tttcggtgtt 
249241 ctcccatgcc gatgacaatt tcaaaactca ttaatcgagg gtctaatgtt tgatagtatt 
249301 tgtgaatctg atgettttte tttacgaatc aaggtatgga gtaattaaat ctattttgaa 
249361 ataattggtt tatttttcat tttgtttcaa agtatattgt ttgcatgtct agttacacct 
249421 agttaattaa taaatgtcca taacatttat aaagcaaatc aattgtttcc taaaaataat 
249481 gtagttagca agaatatagg caattgcccc attcttaact atttacttaa tcttatattt 
249541 aatattgagt actctgeaag ttcagttaaa cagattttac tggaaagaaa acatagataa 
249601 tttattcata agataatttt ttagatttat tgagacctaa ctctatgtga ggtgaataaa 
249661 cattggttta gatgaaaagg gaaattcaat tttatcttga aattatttta atagagaatg 
249721 gaagttatat tttattttac gacaataaat aatttcgaaa ttaatatcct agatcttttt 
249781 ttttgataac aaataaataa ttttggaatt tatttaatac aagagaagtt aaattttgtt 
249841 ttaggacaac acattaacta aactgaaaaa gagaaacatt aaaatagaga ctatctttaa 
249901 ataaagagag tgataaagac acatttaata aactggttaa aacaagcatt tttttaggaa 
249961 tgttattaat agggtctaat gtttgatagt atttgtgaat ctgatgtttt ttctttacga 
250021 atcaaggtat ggagtaatta aatctctttt gaaataattg gtttattttt cattctattt 
250081 caaagtatat tgtttgcatg tctagttaca cctagttaat taataaatgt tcataacatt 
250141 tataaatcaa atcaattgtt tcctaaaaat aatgtagtta gcaagaatat aagcaattgc 
250201 ctcattctta attatttact taatcttata tttaatattg agtactctgc aagttcagtt 
250261 aaacagattt tactgggaag aaaacataga taatattatt catacgataa ttttttagat 
250321 ttattgagac ctaattctat gtgaggtgaa taaacattgg tttagatgaa aagggaaatt 
250381 taattttatc ttgaaattat tttaatagag aatggaagtt aaattttatt ttacgacaat 
250441 aaataatttc gaaattaata tcctagatct ttttttttga taacaaataa ataattttgg 
250501 aatttattta atacaagaga atttaaattt tgttttagga caacacatta actaaactga 
250561 aaaagacaaa cattaaaata gagactatct ataaataaag agagtgataa agacacattt 
250621 aataaactgg ttaaaacaag catttttaag aatgttatta atagggttag tggcatgtaa 
250681 gtataattaa cattgaaaac ttatgggcat ttcctaagtg tacttctctt ttaataatag 
250741 agatattgaa gttgettaca tettgeatat ataatatatg gaaacttgta ggttgaaacc 
250801 tactgtaaga agactattga agttgcttac atettgeata aatatatata tatatatata 
250861 tatatatata tatatacgag gtaataattc atgatgtggg catagttaaa taattaatta 
250921 gattatttat tttacgttgt attaaaaaac ttgtatagtt ttttttctag agttagatat 
250981 taaaatataa ttcagatatt tacaaatttt tgtttgaatt ttgtttgaat ttttttcaaa 
251041 teattgeagg ttcggttagt aggttcggtt agtgtttgag tctaggtagt catcttagtt 
251101 atgtattttt tttaacaaaa atccaaatat atttaagtct tcaaatttga aaataaaaaa 
251161 atataaaaca taaaagttga ataatgtaac actaaacact taaatttaca taaaattagt 
251221 taaatttaaa catttggata aagaacaaat ggatattttt agtattttga atttttattt 
251281 tagatattta cttgtgattg ctttgataat tttgagatat tttcatattt ttgaatatta 
251341 tagttacaaa ataaaaatta tattatatat agtactgttt gttctctcct agatgeggea 
251401 tgtcatcgcc atgtcagaaa gtcaaaccct gtaatgttga tatgtgtcta gecagagtaa 
251461 aacacaacac ctcatttatt tggagtttcg tactgggctc agttgttaat ttgtattgtt 
251521 cgggcctgta aaaactgata gttgggctat gtatcagtaa gcaataatcg tectegttat 
251581 cgacatactc tcgatgaatc caagcgttga acatgataga aacggcttaa tccaagtata 
251641 gtegcaaett etttaaeggt tttaattcaa tgcacctttc attatttctt ttgatctttt 
251701 gagtgaaact tagactagat aagtctcatc ccatgaaatt catcagttcc attttctcaa 
251761 tctttgtaaa ctttcagcaa taagaatggc taaatctttt ctattcttcg aagatttgac 
251821 ggctgttggt caagtatttg ttaagatact catgaagtat tcgccaccgt ttgagattga 
251881 geegcatttg agatcggttt tggttgagta aagatgggaa ctgaagattg aacggaagaa 
251941 ageggecttg acegtcaaag atggtgtcga gaggattgta gagtggagga tggggagagc 
252001 catgatggat cagctggagt gttaggtttg gcaagtgagc tcgagacata ttgtgcaggg 
252061 cactagaaag attcaatata tagtgattat ttagtgtatt tattgtatat atgtttttta 
252121 tcttctaatt tgaattatgt ctatattaaa atgtataaat ttacaaaatt tattagtgat 
252181 tactttattt ttaaatctct aactaaaaat taaaggttga taaccccgta aatetttcta 
252241 aaaccatgtc atctgaaaat aaatggtggt gatgatacgt gtacaaattg ctttacaaat 
2523 01 gattgttagt actgaagctc taaaatgtaa actcgtacca ctaaactaaa tccaaacaag 
252361 tttagcgtga cgattatgtc aaaggttcaa ctctcaacaa tgtataattt gcatttattt 
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252421 
252481 
252541 
252601 
252661 
252721 
252781 
252841 
252901 
252961 
253021 
253081 
253141 
253201 
253261 
253321 
253381 
253441 
253501 
253561 
253621 
253681 
253741 
253801 
253861 
253921 
253981 
254041 
254101 
254161 
254221 
254281 
254341 
254401 
254461 
254521 
254581 
254641 
254701 
254761 
254821 
254881 
254941 
255001 
255061 
255121 
255181 
255241 
255301 
255361 
255421 
255481 
255541 
255601 
255661 
255721 
255781 
255841 
255901 
255961 
256021 



ttatttaaat 
aattctgaat 
aaaaacatgt 
ttccgacttt 
caggaaatac 
gagactaaaa 
aaggcaaagg 
ttaagaacat 
gtaactaaca 
caaagagaaa 
cgaaccaaag 
aaactgagag 
gagaagcaga 
acaacagcaa 
ccctgagcca 
tcaatcgcca 
cgttgactgt 
agccgttgct 
ctataaatag 
ttcgcttgga 
agataaaatg 
ccttaagctt 
acattgtgaa 
ccagcggaag 
gcttctccgt 
ttccaaagaa 
ggtcgttacc 
gaaaattatc 
ttatatataa 
cttcctcctt 
agctgatcct 
tgcaagcaag 
tacatagact 
gagcaagcaa 
caaaatcaac 
acgcgacggt 
ggctccacca 
cccacccatc 
cggctccgaa 
cccaccggaa 
gcatcactct 
tgcgatctcc 
gaaacaagga 
aatcacgctt 
agctcttcac 
actacacgag 
ttctcaacga 
tactactacc 
ttcttttgac 
gtttacacca 
cgctgagaca 
aagcgacgat 
tttcaactgt 
tccagggagc 
gacaccaatc 
gtgcagctct 
accactagga 
cctatgggat 
acgtggaata 
gttgcaagat 
agaacccctt 



cagttacaaa 
tttggtgtta 
aagcccacgc 
tttttttctc 
aaagctcaaa 
gcaaatacat 
atcgctggag 
ggaaagaccc 
attcctacaa 
ccaccaggac 
attgcggata 
agctttcctt 
acccaggagg 
gggcaatcac 
tagctccaaa 
caaacgagga 
ctaaacggag 
tcaccctaca 
aaaagagccc 
tctgggtttc 
tcatatgatt 
cagaatatgt 
actgtttagt 
atcgtcatga 
tctttatccc 
gctatagatt 
acaccgcctt 
gtttgttaag 
ggctatgcct 
attactttct 
tcttcctacg 
gttactagtt 
cctcaggacc 
atagacgagc 
gcactcgaag 
aaacaaacag 
gaccgttcca 
cat cat age c 
ctacgagaat 
ttgaaggttc 
taccttgcca 
gategtttea 
aacgatacca 
gcgccggatc 
gtgcgccaat 
ctctccctct 
tcaaactact 
gccaatccaa 
gagatgeagg 
ggtcaccacc 
gaatttgacg 
gatgacaaag 
atgcgcctcg 
acacacaact 
aaaccaatgt 
tttgcttgga 
tgcagtgatc 
ttcctaaacc 
tcacccaaca 
ccacagcttg 
gaaccggaaa 



aaaaaaaaca 
caagagaact 
atcgaatatt 
tgaacaatac 
agattcagaa 
aaacgctcag 
acaatcaaaa 
caacattgta 
gaactatcac 
agate cgect 
gaacccaggc 
aagatagcat 
tcacacccaa 
aagacatggg 
gggaaaccaa 
gctcgaacag 
geggagcegg 
aaaacagega 
agetceggtg 
tagagagacg 
gaacgacgtt 
gagagagaga 
ttgtgaaaaa 
tegtctttge 
tttcgaattc 
gtaaggaggg 
atgttttcct 
ataatgatcc 
cgatgatata 
gtgcaagcaa 
tgatcaactc 
tgggacttgt 
caatcatgga 
tacgttcttc 
cactcatcga 
aegcaagtte 
atccagagaa 
tatcegcaag 
ggacctaegg 
gtcttgeate 
atcgatacaa 
gtgagcttta 
tcgatgtgta 
aegcattgag 
tcaaggtcag 
cacatatgcc 
cccaacccaa 
acaacaaacc 
agegcaaacg 
ttaagcataa 
aegagattge 
ttccaactat 
tcaggaaata 
tcttggacat 
cagttgttgc 
agatgeaagg 
tcgtcctggg 
ttcgtatgga 
gctcaaaatt 
ctctactcca 
caattttctg 



aaatccaaac 
agacttttaa 
gggaagtggc 
tttaattaga 
actaagcagg 
teaaaageca 
caaagctcta 
tgacctggcg 
gaatcacaaa 
cacataacgc 
eggcaaagat 
aacegtaaaa 
acaccagcaa 
ttaaaccaaa 
atctgaaaac 
taacaaagga 
agegctgett 
cgaaacagag 
ctatggtagc 
tggagaaaga 
tccgactttt 
gaggtggaag 
gcttcctcga 
aaccgcgctc 
gtcccttttc 
agggatagct 
atttagtttt 
aaagcttttt 
tcaataataa 
tacaatatcc 
ctaggtctgt 
accccgttat 
gacaaggtaa 
gcaaaaccag 
aaagtacttc 
tgatattacg 
cagttccttt 
gctaacaaag 
ctgtgaacag 
tcttcatatg 
catcttttca 
cgacgatcca 
tctagataaa 
tatattcttg 
tacagtacct 
aacaaagaca 
taaaagccaa 
cctcattgcg 
taaaggatta 
gcgcgctgaa 
actagtggag 
ctccgtccac 
egagaaaegt 
ccagatggct 
agcaagtggc 
etaegggtte 
ggttcaatgg 
gttcaaattc 
aatttctgga 
tettcgegag 
tcatattgaa 



accatagata 
gactagttgg 
cacgtcatca 
actttaacct 
catataatat 
caacactccc 
aaagacagga 
aggecacaat 
caaggctaga 
taccggcaac 
cagcaacggc 
acacgatctt 
cgacaatacc 
caccgcgaaa 
teagaacgea 
agaagctgag 
cagacgaacg 
ctacactaga 
caccggagcc 
gagagaaaac 
ategtcttet 
caaataatca 
tttcaaactc 
tagtaatcca 
ctgatcccaa 
tgtgacagca 
atttcctatt 
ecaattegga 
gaataaagtt 
ttacaatctc 
gaatcccctt 
cctatctcat 
accacatcct 
caaactgaag 
gctaacgctc 
gatggaacac 
aaacctcacg 
attggctttc 
ttcttctcca 
aegggaaaag 
ttatggccag 
ttagcagagt 
tttgattgcg 
acaaacatga 
gaagccgcaa 
tcgcgccccc 
aaccacaact 
aatgctcctc 
tgcatgttct 
tttttgttct 
caaatcegtg 
gccctcaacg 
aaactacata 
aagggattag 
gacttggtta 
acagctgaga 
ctttccacct 
aacgagctaa 
agcagcttta 
attgacgaaa 
gcgagtgaaa 



gcatattgtg 
gcttagatta 
getactttge 
aaacttataa 
attatctcct 
ttcataaacc 
ccaccagaaa 
gatagttgtg 
actttagggg 
cgcaattgga 
aaagaggagg 
ccacgagcgg 
ateaagegat 
ataagecaaa 
aaaacacttc 
gactaaaaac 
aagcctagga 
aaaagacaaa 
agegacgaga 
tttggaatct 
tgcctccaag 
tccaattaca 
atagctctcg 
gattcgattt 
tgtgtaaatc 
caaatgtcgt 
tatcattatg 
ttttagcgtt 
tacaaatcat 
aggttcttag 
ccttctaacc 
tggtatcaga 
tgagegacat 
aacteggtag 
cacccccgca 
cgcaagctaa 
acaacaataa 
caatgtttga 
tcgacagcac 
cactacaatg 
aatatgttgc 
tggtaagctt 
ccatgaccag 
atcaacatct 
agatagecaa 
cattcaactc 
ccacctcccc 
aaaaatggct 
gtgaagaacc 
tggacttaga 
agacaaccat 
gtgcgccaac 
tactgattga 
gttgttcttt 
ctaagtacaa 
ttcgaacctt 
taggaccaat 
aacatgtctt 
acaaactgat 
ctacagagca 
cagagaacga 
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256081 caattccggt tcactcgaga gactccttga ttcatacacg gacgtatttg atgagccgtc 
256141 gaccttacct ccttatcgcg cgggtttcaa tcacaaaata ccattggaag caggatcaaa 
256201 tcctgtgaat ctccgaccat atcgatactc ttcgatacaa aaggattcaa tagacaagat 
256261 gattcaagac atgctttctc aaggtatcat tcagtacagt gcaagtccct acgcttcacc 
256321 tattgtactt gtgaaaaaga aagacgggtc ttggcggctt tgtgtcgact acagaggtct 
256381 caataagcaa acgatcaaag acaaataccc catcccacta ctcgaagatc tccttgatga 
256441 attgggcggc tcaaagtact tctctaaact ggatttacgt gcaggtttcc accagctttg 
256501 tatgtctcca gaagatgtgc acaagacagc ttttaaaaca cattcaggcc actacgaata 
256561 tttggtaatg ccattcggcc tcacaaacgc accttgcacg tttcaggggc ttatgaatca 
256621 cgtattcgca cccgttctac gaaagtttct ccttgttttc ttcgacgata tattaatcta 
256681 cagcaagacc tgggaggagc acctggatca cttggacaag gttcttgcta tacttcgcca 
256741 tcaacaactc tatctcaaga aatcaaagtg tacttttgga ggaacgagaa tcgaatacct 
256801 tggtcacttt atttcccatg atggcgttag tactgaccca accaagataa aggcagtcga 
256861 ggaatggcca caaccgaagc accagaaaca cctccgcagc ttcttaggtc tagccaatta 
256921 ctatcgaaga ttcatacaag ggtacagcat tatcgctcga ccccttacca tcatgcttcg 
256981 caaagacggt tttgcttgga atacagaggc ttcagacgcg ttccatctcc tcaaacaagc 
257041 ^ at taatct eg gccccggtcc ttgcactccc cgatttctcc aagactttca tcgtcgaaac 
257101 cgacgcttcc aacactggca taggcgcaat tcttatgcag gacaatcacc cagtgtgcta 
257161 cataagtcgt gcattagggc ctcgacacca aggcctttct gtttacgaga aggaactcct 
257221 tgctgtggtc cacgcagtac agacttggaa cccttacttg gctcacaaca agttcattat 
257281 tcggacagat cagaaaagtt tgaagttcct actagaacag agatcaccac tacattccaa 
257341 catatgtggc tatcaaaatt aatgggttat aactttgaga ttcagtataa agaaggcaag 
257401 gaaaacgtcg ctgcagatgc tttatctaga gtctccggtt cccaactact tcagctctct 
257461 ctctctctca tgctcaccac gggttctatg attctttgag gatgctctgg gatacatacc 
257521 ctactctgaa acagatcatt accgacctgc aaaacaaacc tgggtctcac gccgcgtact 
257581 cctttgtcaa cggagaactt cgccgccggg gtaaacttgt ggttggtaac gaccctgcta 
257641 tcaaacttca tatcttcaag tggcttcatg attctgctgt aggtggtcac tcgggtcgag 
257701 atgcaacact acaccgcatc aactcgttgt tcttttggcc aaagatgagc ttggaggttc 
257761 aaaactatgt ccgcaactgt ccagtttgcc agaaaaataa atatgattta gctgcgaaac 
257821 ctggactcct ctagccgtta ccggttccta cgggtgtatg ggagtctgtg agccttgact 
257881 ttatcgaggg tctaccacca tcatcaggaa aacattgcat tctcgtcgtt attgatcgtt 
257941 taagcaagaa tgctcacttt ctcgccttgt ctcatccgta cactgcaatg gacgtggcaa 
258001 agctatatat ggatcaggtc tttcgtcttc atgggatgcc taaagacatt acaagcgatc 
258061 gtgacccaac gtttctcagc gaggtttgga gagagatgtt ccgcgttcat gtgtcgattt 
258121 aaacttctcc accgcttacc atccacaaac tgatggtcaa accaaggtta ctaacaaaac 
258181 tttggaaacc tatctccgat gcatgacttc tgattcacct tctacatgga gcgcgtggtt 
258241 acccttagca gaatggtggt ataatactac ttaccacact gcgattcgca gttcaccgtt 
258301 cgagatcatc tatggccagc cacctccagt gcacttgcct taccttcccg gcgaaagtac 
258361 ttctaccacg gttgacaggt ccttacaacg aagggaagag ctcattgaca tgatgaagtt 
258421 ccacctcttg cgagctcaga acaggatgaa acaatacgca gactcgcacc gttctgaacg 
258481 tgcgtttcat attggtgact atgtctattt gaagctccag ccgtatcgac aacattccct 
258541 caaaggacga catttgcctc acaagctttc accacgtttt tatggtccgt acgagataca 
258601 agatcgtgtt ggtaacttgg catataagct acgtcttcca tttgaagcag ctatccataa 
258661 tgtcttccat gttagtcagc taaagctcgg ccccaacccg cctgcaactc cttcgtcgct 
258721 tccccaatac ctcaaagatg tcggcactgc aaaggaacca gaaaagattc ttgaaacaaa 
258781 aatggtgaat cgccgaaaca gagctgtcac gaaggtacta gtacagtgga aaggctactc 
258841 tccggaacaa gccacatggg agttctatca agacttcgtc gcgaaacatc ctgatttcaa 
258901 tacttgaggc caagtatcat ctgaaggagg aagtattgtg acagcacaaa tgtcatggtc 
258961 gttaccacac cgccttatgt tttcctattt agttttattt cctatttatc attatggaaa 
259021 attatcgttt gttaagataa tgatccaaag ctttttccaa ttcggatttt agcgttttat 
259081 atataaggct atgcctcgat gatatatcaa taataagaat caagtttaca aatcatcttc 
259141 ctctttatta ctttctgtgc aagcaataca atatccttac gatctcaggt tcttagagct 
259201 gatccttctt cctacgtgat caactcctag gtctgtgaat ccccttcctt ctaacctgca 
259261 agcaaggtta ctagtttggg acttgtaccc cgttatccta tctcatagct ttatgaccta 
259321 cattctgttc tcggaagtta aggtcatgta tgtttcatgc catcagaaga attgtttttt 
259381 ttcctgaaag ttccatcaag aataattgtt gttaccctct taattgaaat ggataaattg 
259441 attttggtca catgagcatg atgaacataa acaatcacat aaactagaac ttagaaaatg 
259501 cagaggttaa gaagttgatt ctggtaacat gatcatgaat ctattgacga acataaataa 
259561 aaacacttat ggatgccaaa caagcaaact cttgattata agaggtgtct agaagattgg 
259621 aggagtgaag gccttctcta tgaatcatat tagcatgagc aatcggccta gcttcatttt 
259681 ggtgtctgac tcacgccatt cccatcagcg gttcaaagtc atcttggcga aaaacctttt 
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259741 gatgagatat agaaaatcag tttaattaaa 
259801 aatgttaaaa aagtaaagat gttgccatga 
259861 aaagagtctt ttaaaatttg acaacagtta 
259921 aggtataaac atcataaacc tgattcaggt 
259981 ggtaatgaaa caaaatatta atggttcatt 
260041 aagacttgtg tgatccttag tccaaaaaaa 
260101 gacaaacaaa aaaaacaaag acttttgcgt 
260161 cccacgcaac cgaatattgg gaagtggaca 
260221 tttatcgtct tctgccccca aacctcgaga 
260281 atcatcgaat tacaacattc tgaaattctc 
260341 actgctctct cagtgtctaa taatccttgg 
260401 tcgtcagcgt cacgatcgtc tctttgcaat 
260461 ttgctaccat tctctatccc ttttcctgat 
260521 aactccaaac attaggggaa gatgattcat 
260581 gccgctcttt atcttctcgg ccggagtagt 
260641 actagggtta accaactcaa ggagctaggt 
260701 ctctctcata tatattattt ttgtctgatt 
260761 tgtattttgc agcgcaattg ctagaattag 
260821 ccgtttcagg aagagttggc tctgacactc 
260881 gcgttatcgt cgaggaaacg gtacttatta 
260941 aggtgtatct atgcaaatca tatatagttg 
261001 tgaaacacaa tgagactggt tcttgggtac 
261061 aggaggttcc ttggttcctg gtatgtcaaa 
261121 ttttttgaaa aacaatctct cgaatttcga 
261181 gtcgtgtgaa tgtagtggga gctcgtggtg 
261241 aagtttttga agagtcaggg cggtctcttg 
261301 ttaaggtatt tttgcttttc tattcggttg 
261361 ctttaagcgg gtatcgtttt ggtttcagat 
261421 tcctactgga atgcctctca caattgttgg 
261481 ttttgtttaa aatttcttaa gaatagtatt 
261541 ggcagactga tgagtttgat ttaatttctg 
261601 cgatattggg gacctaagga ttcagaaacc 
261661 atcactcgat cagctcattt ctaatctggg 
261721 ctctcacttg gttcttcttc tcctatgcta 
261781 ttttacaggt tgtacaagta tgcctccatg 
261841 acaaagcatg tcattgattt tcttctagag 
261901 tatgttaaac atttgtctgt ctataaagcg 
261961 ttccctcgga aattatatac acaaaaatat 
262021 ttgatgcagc agctaagaga gctgggactg 
262081 ttccactcat aaattggtag cacagtttct 
262141 aacggcgcac atgagagcgt atcagattct 
262201 tgtgtgatct gccttgagca gaactacaat 
262261 ttcattttgg ctgtttttta catttgcaaa 
262321 tctcgcaggt gtggtcatat gtgctgctgc 
262381 ccactttgtc ggagacgtat agatcaggtg 
262441 aactcagggc tcagaagcat tctctactcg 
262501 gatactaaat tggtgttcgt tatcctctct 
262561 tatataaaaa cataaggtgt ttgcagtaag 
262621 gacattaggt gtttgcagct taacctgttt 
262681 atatctgtct ttttccccaa aacaagcttt 
262741 agcagagaag ccagctccaa caatctgtac 
262801 tcgtgtagat tattccttca gtaatattag 
262861 ggcttgatca aacccaagcc ttcttatctc 
262921 ttcaacaatg ttgagttcgt ggtaggtact 
262981 ttcaagccat ccatgttctt tagcatataa 
263041 ttcctgtcag tgaagcgacc ctagtttatg 
263101 tcaatctgtg tggtttaggg gcaaccactg 
263161 atctagtaca taaaatagtt tatatatatg 
263221 ttactatgtc aaatgtgaat catgtcaatc 
263281 aacggttcaa tattctatat aattttcaat 
263341 attctatgtt ttgttatcag tatcattggc 
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cacaaaccaa atttgaatga cttctccagc 
catacgcagg cgtatgttga caaaccgacc 
aaaacggatt ctacagctaa tatatgcatt 
ttatgattca gctaatatat gcagcaatcc 
agttagaaaa agattatcac gtgattaaca 
aaaaaaaaca aagacttatg cgacgacgac 
ctagttgggc ttaaataata aacatatagg 
cgtcattagc taccttgctt ccttccgacc 
gagagagaat agagatcgaa gcaaagaatc 
accttcttcc gatccacaga ttattccttc 
aacgcttcct cgatgtcaaa ctcatagctc 
cgcgctctta gtaatccatc ccagattcga 
cccagtattt aattttcatc cgcggctgaa 
tggggtggag taacctgctg cctcagcgcc 
ggcagggacg ctgaagtact caaaaccgtc 
catcttctcc cccaaattgc ttcatctact 
cttattaata cttgtttttc ggatttacta 
atagcagcaa gctccttcct ttcatcgtag 
ctatcaagtg cgagcatagt ggcatacgcg 
atctagtttg tgtttcccgt tttgtcagtt 
atgtgtttca tcaggcggaa caacattttc 
aagatagtgc acttatgcta tctatgagca 
atatcaagtt tgcttactgt ttggatgttt 
aattgtttgt gaaacaggac gatgggacaa 
caacaggttt tgctttgacc gtcggaagtg 
tacggggaac acttgattat ctccaaggcc 
cttgtttgtt gttattgacc tctctatttg 
gcttggagtt aagcgcattg agcgtgttct 
tgaggtatgt cgtattctca gtgtcttctt 
tgtagcaagc tggattgatc tttctgcaga 
ttgcttgcat gtattgcagg ctgtcaagga 
tgaaagaggg cctttctacg tctctcctaa 
gaaatggtca aggtcctctc tctctctcct 
tttcagaaaa agacgatgag ccaaactgtt 
ggtttaactg ttttcggtgt gtttctaatt 
agaagacagc ggcgagaaat acagaaaagg 
aataattgtc ctcaactaga tcagttttgc 
taagcagtgt cagctgttgt tgcagagtgc 
aaggtatcca ttgggtgaaa tctttttatt 
gaccacaaaa cttttctctc accaggttca 
accaagaatg aaggcgctgt tcctgatctc 
gctgtgtttg tcccgtaagc attcttctgc 
ttgctaaaaa gagcgtgtgt tgttgttact 
accgcatgct cctcccattt gtccagctgt 
gttaagacat atcgtcactg agcaaaactc 
agttgcctgt aaatactgca agatcaaaac 
tgtttgcctt caagaacaat acatttgaac 
ttcgatttaa ttaatcttgc cgctgccaat 
gaatcttcct tgcttcactc ttgattgact 
aagtgtttct cacaaaatgg gagatactaa 
gattctcagc tttgccattg ttgtcttctt 
tgttgagaga agagattgag tcgctttgtt 
tttttccacc gtaaacacat gggcaccaca 
gatgaattcg aacatcatct cattctcctg 
atagatgtcc cgcttggatg catagacatg 
cgcacgcgac acaaaaatct gctagctact 
aaccacgatg caaaacctaa attctctgga 
ctatgccggt ttttgtgtta caaaaagaaa 
gtataacttc caaatcatgg ctggatttta 
caacaaattt gtttctttta tcaccttcat 
atctcatcca tctttcattt gtttcaggtt 
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263401 atcaagttgg atatcggcca tatcgaccca tcgatcacat ctaagggcgg tcggttagaa 
263461 cttagaacct cggaaaccaa tctggtcgac ccagattctg agacagagat aatcagtatg 
263521 acagatagat cctaactagg agctaaccaa tagctctaag aaaggtgaat actagtctag 
263581 aaggggactg tgagttttag aatagtcacc tcgaagatgc tagatcctta ggatataaga 
263641 acagagagtt ttagaggttg gataaaatga tttctgatag atctctgcaa atgcaggaga 
263701 tgagtacaaa tactaaagtt cgagggtcac actaacggtc taaaacaatg acctgttaat 
263761 gacgacaaaa gcataaagta aaagcataaa gcaaaaggtg aaatgagctg gatgtatttg 
263821 ctgactacgc ctttttgtga tcaggttgtt aggcgtgata ctcctgtgac aatactccct 
263881 tcttcagagc aaccttgtcc tcaaggattg aaaagaggat acttgttggt gaagtccttg 
263941 tagaactccc aagttgccat ttcaggtgat tcatctttcc agtggactaa aacttttgtt 
264001 gcagcttcgt tttggcgctt aacagtctta gtctccagaa ttgctgcagg ttctttggtg 
264061 ttgccgaggt caagccagta ctgaggaacc gcaggggagg atgaagcagg gtttgggcac 
264121 agctttagtt gactcacgtg aaatgtgtca tgaattgcag cttcagtagg cagcgagagt 
264181 ttgtaggcaa ccttgccgat acggtctagc acacggaagg gaccataaaa gcgtggcgag 
264241 agcttgtgtg gagccttatt ctttttcact gtatgctggc gatatggctg gagcttcaag 
264301 tatacgtaat caccgatctt gaactctcgc tgggaacgct ttgcatcggc atactgccgc 
264361 atacggttct gagcgcgaag caagtgaaat ttaagcatgt taatcacttc ctcacgcttc 
264421 tggaggctgc ggtctacaac aacggaggag ctttcacccg gaaggtaagg gaggtgaagt 
264481 ggtgggggtt gaccgtagat gacctcgtat ggagtagctt gaatggcaga gtgaaaagtg 
264541 gtgttatacc accattctgc aagactgagc catttgctcc aggatgttgg tgtctcagca 
264601 gccatacagc gtagataggt ctccaaggtc ttgttggtta cctctgtttg accgtcggtt 
264661 tgagggtggt aagcagtgga gcgcttgaga gtaacccctt gtacacggaa gagctcgttc 
264721 caaacctcac tgataaagat cggatctctg tcactgatga catctttggg catgccatgg 
264781 agtttgaaga tgttgtccaa gaatgcctga gcaacagtga gggccgtgta tgggtgggac 
264841 aagggtataa agtgtgcgtt cttgctcatc ctatcgacca ctaccataat acagtgtttg 
264901 ccctgagacg gcgggagacc ttcaatgaag tctaggctta tagattccca gatcccatca 
264961 ggaattggta gaggttgtag aagacccggt ttggcagcca tatcgtactt gttctgttgg 
265021 cagatactac agtttcgaat gtagttctgg acctccacgt ttagcttagg ccaatagaac 
265081 aaagatctga tacgatgtaa ggttgcgtca cgtccagagt gtcccccaat ggcagagtcg 
265141 tgaagccatt tgaagatgtg caacttgata tctttatcgt ttccaacgac tagcttgcct 
265201 cgacgtctca gctctccgtt tgtgtatgtg aagagggggt gtgaagagct gttagatttc 
265261 aagtcgcaga taatcttctg gagagtagca tcggtttccc agagtagctt caacgaatcg 
265321 tagaaaccag tatgagcctg agatagtacc atgttcagta actgagagcc agagactcga 
265381 gagagagcgt cagcaacaac attgtctttc ccttgcttgt agtggatctc gaagttgtag 
265441 cccataagct tagagagcca catgtgctgg aaaggtgtag ttattttctg ctccatgagg 
265501 tacttgaggc ttcgctgatc agtgttgatg atgaaagggc gatgcgcaag gtaggggtgc 
265561 caggtttgca ctgcgtgcac taatgccatc aactcctttt catacacaga caggttttgg 
265621 tggcggggac caagggagcg acttatgaag cagatagggt gttttccctg cataaggacc 
265681 gcaccaatac cggtgtttga cgcgtctgtc tcaacaatga aaggttgatc aaagtcgggg 
265741 agggctaaga caggcgtgga actgagcgct gctttgaggt ctgagaagga ttgatcagct 
265801 tcaagggacc aagtaaagcc atcttttttc agtagcgtgc tgagggggcg agcgatggag 
265861 ctataacctt tgatgaagcg gcggtaatag tttgccaatc ccaagaagct ccgtaggtgc 
265921 ttctgtgtcg tagggattgg ccaatcacgt attacagcaa tctttcgggg atcggtgctc 
265981 acgccgtcag cagaaataaa gtggccaagg tactctatga tcgttgcacc aatagtacat 
266041 ttagagagtt tgagatacag ctgttgttgt tgcaggatca ggaagacctc cgcaaggtgt 
266101 tgtaggtgtt cctcccaagt cttgctgtat acaaggatgt cgtcgaaaaa gacaagcact 
266161 gatttccgag agaggtcttg aaatacatga ttcataagac tctggaaagt gcaaggggcg 
266221 tttgtcaagc cgaaaggcat gacaaggtat tcatagtgcc cttgatgtgt cttgaatgct 
266281 gtcttgtaga cgtcatcctc cgacattcgg agttggtgaa aacctgctcg aagatccaac 
266341 ttagagaagt acttggcgcc accaagttca tcgaggaggt cttctagcag agggatcaga 
266401 tacttgtcct tgatcgtctg cttgttgagc cctctgtaat ccacacaaag acgccacgtg 
266461 ccatcttttt tcttgaccaa gacgatgggt gaggcgtagg ggctagcact gtgctgaatg 
266521 atcccttgag tcagcatttc cttgatcatc gtatcgattg tatctttttg aagagaagag 
266581 taccgatacg ggcgtaagtt gacggggttg gagcccgcaa ggagtggtat ctggtgatca 
266641 aaaccctctc taaacggtgg cagtgactta ggttcttgaa agatatcttc aaacgtttga 
266701 agaagttgtt ggagagctgg gtcgtcagca tttgttgttc ccgaagcaga gatgtgagag 
266761 tacaacattt caggattgag gctttgctgg ttgtcggcat tctcatcaat ctgttgtagc 
266821 tggattaagg ctatttgagg ttcttggagc atgagcttgt tcaagctgga gcccttaata 
266881 accttgcctc ctgtcttaac gacgccacga agtacatgtt tggtgccaga gagtgtgaac 
266941 tccattctga ggttgaggaa gtcccaaaga ataggtccca aagtgcaaag ccattgcact 
267001 ccaagcacaa aatcgcagca gtcaagaggg acagtgcgga tttccgtcgt aaaggaggag 
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267061 ccttgaacgg tccaagtgaa agccgagcac ttgaagtttg tgagaagggt gtcaccagtt 
267121 gcagctttca ctgacattgg gcgcgttggt tcgagctcac aacccagttg agtggcgatg 
267181 ttgaggtcca caaagttgtg ggtacttcca gggtctacaa gtatatgcag cttgtgtttg 
267241 ccgtgatgac caatcaagcg catacagttg taggaggtgg agccgtttaa ggcattgatg 
267301 gagatgactg gtgtgacttc aacagtcgct tgcttgtcgt ctgactctgg ctcggcatca 
267361 gaggagctat cgtcagggat ggtatcagca tcatcacact ccatgacata gatctgagag 
267421 cgcttgtgtt ttagttgatg gccaggagta aatggttcat cacaaaacat acataagcct 
267481 ttcgatcttc tgtcttgcat ttcctggtag gagaactttc gaggtggttt gtcagtgggg 
267541 tttcggggaa taaaggaggg tttttggttg gtgggggctt gggttagagg taggaggggg 
267601 gaagggttgt ttggtttttg gtagggtttt gtgtttgggt aaggattgaa gggggctctt 
267661 tgtcggtttg gggtatggga gagagaggat tcatggagca tagcaatctt tgcagcgccc 
267721 gcaatggagg ttgtttcaaa ctgtctcgtg tggagagaga gatgagggtt catgttagcc 
267781 aggaaaatgc taagagcatg agcctcagga aggacaaggc gcatccttgt cgtctcaaac 
267841 ttgtcgagat aggcagcaac caaatcggag ccttgcttga gggccactag ctcagcaaga 
267901 gggtcgtcaa agagttcaca aaaacgagca gatatggcga ttatatagtc tgtccaagag 
267961 gggaacatgc caaacctagt gctcatgtag tttgtgtgcc attgagtagc tcttccggtg 
268021 agatgcatcg cagctaagcg taccttcagc tcctgtgagg tgccgtcgat atcaaagaac 
268081 tgctcacact tggaaagcca gtcgcggagg tcagtgccat cgaactttgg gaacgagatt 
268141 ttggaaaggc gagatgtgag tctgccaaac tcgttgtgtt gacggaatcc gttgagatct 
268201 ggggggtcag gtggttggga agaaccgagg ttgaagggtg tcggagggtg ggaggggccg 
268261 gggtctagtg gggcttttcc ggtggcgtgg agcggtgagg cgttggcgga gaacatcagg 
268321 gcctcgagtc tgtcgaaacg agcgtcgagg gagtcggaac gagacttaag ctcagcagcg 
268381 aggaggtcgt gtaaggaccg taactcgtca acttgctcag tcagagatct ctcttggagt 
268441 cgtgtttcaa ccatggtttc ggtaccgagg aaaggtgact tctgatacca atgagacaga 
268501 gataatcagt atgacagata gatcctaact aggagctaac caatagctct aagaaaggtg 
268561 aatactagtc tagaagggga ctgtgagttt tagaatagtc acctcgaaga tgctagatce 
268621 ttaggatata agaacagaga gttttagagg ttggataaaa tgatttctga tagatctctg 
268681 caaatgcagg agatgagtac aaatactaaa gttcgagggt cacactaacg gtctaaaaca 
268741 atgacctgtt aatgacgaca aaagcataaa gtaaaagcat aaagcaaaag gtgaaatgag 
268801 ctggatgtat ttgctgacta cgcctttttg tgatcaggtt gttaggcgtg atactcctgt 
268861 gacagattcc agcctgtatt acttcgggat cttttttttc tcaaattcct ctatcgtccg 
268921 ttgagattca cttcttcggc ggcactgagc tcgaaattgt cgtctgcttt ggcggtactg 
268981 ttcactcaat tcgtattcgt cctcactcca ttcgcgctcc tcactcgatt catacggtgt 
269041 cgcctgtaaa ggattcgctg ggttcaactc aaggctctcg tcgatcagac tattaatatc 
269101 gtcttcacat gattcgtacc tcccatcagt gaggtaatca atgtggtctt cagacacttg 
269161 gtctgatcca gattcatagc tcccttggat gctcttgtct tcaaacactt tggcaccaca 
269221 ttcaataacg tcgaattcgt gagatttgct gctgaattcg aacactgtct ctctgctgac 
269281 ttcggagggg acgatgaatg gcaagcgagg gtgaaatata agtagatgtt cagatcgata 
269341 tctggagaca tagcttacgc aaaactcctc aacaacgggg tctaggtcgc ctttgcttat 
269401 gcagcggcac actaagtatt ctgaatgtct actttttttg ctgatttgat ggttaggtga 
269461 caccacaacg caaagcttca atatagatac agcagaaaga ggattgttac catagggaat 
269521 tctaatggtc aatgaacttc ctctggctcg gtgatcaaac ttggcaggta ctttttttcc 
269581 tggtaagata acccattcac aagacaagga acggtggata atttctcttc gtgcttgttg 
269641 gcctaatttg aagcagttgg tgaaatcgag tactgcattt ggactgctat tcaaagggca 
269701 aaacacggtc tccagtgatt cgcaatcttc cgcatttaag gatttgatcg aaccagggag 
269761 ctctggcaat gatgcgagtc ttctacatcc atcaagggag agttgctcta gctgatgaag 
269821 atctttgatg caatctggaa tcgtctcaat atcagaatac cgtaggtcta gacatattag 
269881 actcttgtgg agatgtgtta atcccttgag tttcccactt ttccttatac gagagtacat 
269941 accagatgat ggcggccgtt gtcgaaaaga gtccccacat atactcaatc tcggtgaagt 
270001 agttgaaata aacagaaact ttctcaattt tgaacatccg ttcatttcga caattccaag 
270061 agaagccaag ttcaggtggt ctggaatgac ttgtaggttt acgcaatcat ccatccacca 
270121 caattctagt ttctgaagat gcataaaaga ggatggaatc tctaccaaac tcttgcaata 
270181 actcatatcc aatatcttca gatttgttgc tcttgaaaga tccgggagtt ccttcaactg 
270241 gcatgaaaat gacagatcaa gcttcttgag atttgcaagc cgctgaattt gagaaccaca 
270301 caacaacaga gggacatcat ataacatgaa gagaagataa ttcatatata tatatatata 
270361 tatattaagt acatacacca acctgggttc ctttccagag gtgctcgaat ttgccatgct 
270421 tcatatcaag ttccacgaga tattcaggat gaaaggaagg aggaagacac ttgttagggt 
270481 atgcattcca atgtagaaac cttagacggc gtggaaactt catctcctca ggtatatgta 
270541 ttacatcatt catatcatct ctgctttttt ccccagcaag aatgtcaaat tttagtttgt 
270601 catctctgct tttgtagact ttgagaaact gaagattata cattctttta aaagctcttt 
270661 cgtcaacaat caggtcgttg attcctgata tatcaaatga tatgccggac acagctctag 
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270721 ttccctaaac caaattacag ataaaaaata aaaggttagg tccctatgtc agattttgcc 
270781 tagaatgact ggtctgatat aaaaataatt accaaggaat caaacagata tatatatata 
270841 tataaaaaag tggaaaagaa tttactgttt cgttttcgag gacatcgcaa atctcatgcg 
270901 catccagtaa gatttggcgt ttccaaggct cttgtttatg aatggcttgt ctacccactc 
270961 gtcgtagtag cttgtgcatc tcgattatat tcccatatgt agatatctct atgagagatc 
271021 tatagactag gatctgcaaa ccgcgtttga catctattgt accgtcagag aacatggctt 
271081 tcacaagatc actatcttca ttgttgaaga agactgcaat gtggagaaat agggtttgct 
271141 cattctcctc taaactttca tagcccactt ttagcacgtc ctctatacct tgatcaagaa 
271201 tagtttccag tctgttcact acatcttccc attcgtcctt cttcttccca cgtaaagatt 
271261 tacccaccac acatagagcc aatggaagat taccacaaag ccttattatc cttgtacaaa 
271321 gcacctcaaa accataaagt ggagacctta gtctaaaagc acatctacat aagatctcga 
271381 tagcttcttc atcaaatgga aaccccacat ggtaagtgct accgatacca tgttgctgca 
271441 aaatctctag gttttctgtg gtcactacaa tcctactccc aggaccaaac catgcactct 
271501 cattagccaa cgcctctaat tgctttagac tgttcacgtc atcaagaatg ataagcactt 
271561 tctggtcgca taggttttct tttactgcac ctaaatggca tatcctcatg tccttgtggt 
271621 tcaaaacctt tgaaagaaaa ttttcttgta accttttctt ccaaccatat tcgtcaagac 
271681 cactaggaca gatttcccta aggttgtcca caaaacaagt aagctggaac ctacgggaga 
271741 gtagactatg taaagctcga gctatggtag tcttaccgat ccctgcagga ccagtgatgg 
271801 caaccatctt aactccatcg taatccagat ctaacaaaga ctctatttcc tttagatgag 
271861 catcaagtcc cacaatgcct tcaaaatctc tacacggtgt tgcattcagt ttttgtaaaa 
271921 catcttttgc aatcttcgca atcattttag cttcattgac ccttagataa aaaaaaaaaa 
271981 acaatagaga 
(SEQ ID NO: 87) 

Gene 26 Product. 

^lLARVCGFKCSSSPAESAARLFCTRSIIUy^LAKASGESCEAGFGGESLKL 
Q SGFHE I KGLEDAI DLF SD^RS RPLPS VVDFCKLMGVVVRMERPDLVI S 
LYQKMERKQIRCDIYSFNILIKCFCSCSKLPFALSTFGKITKLGLHPDW 
TFTTLLHGLCVEDRVSEALDFFHQMFETTCRPNVVTFTTLMNGLCREGRI 
VEAVALLDRMMEIXSLQPTQITYGTIVDGMCKKGDTVSAIjNLLRKMEEVSH 
1 1 FNWT YSAI IDSLCKDGRHSDAQNLFTEMQEKGI FPDLFTYNSMI VGF 
CSSGRWSDAEQLLQEMLERKI SPDVVTYWALINAFVKEGKFFEAEELYDE 
MLPRGI I PNTIT YSSMIDGFCKQNRLDAAEHMFYLMATKGCS PNLITFNT 
LIDGYCGAKRIDDGMELLHEOTETGLVADTTTYNTLIHGFYLVGDLNAAL 
DLLQEMI SSGLCPDIVTCDTLLDGLCDNGKLKDALEMFKVMQKSKKDLDA 
SHPFNGVEPDVQTYNILISGLINEGKFIjEAEELYEEMPHRGIVPDTITYS 
SMI DGLCKQS RLDEATQMFDSMGS KSFS PNWTFTTLINGYCKAGRVDDG 
LELFCEMGRRGIVANAITYITLICGFRKVGNINGALDIFQEMISSGVYPD 
TIT I RNMLTGLWSKEELKRAVAMLEKLQMSMVCKFLFSLC I FYINKNVYI 
LLCVASD 
(SEQ ID NO:88) 

Gene 26 Product 

atgttggctagggtttgtggattcaagtgttcttcttctcctgctgagtc 
tgcggctagattgttctgtacgagatcgattcgtgatactctggccaagg 
caagcggagagagttgcgaagcaggttttggaggagagagtttgaagctg 
caaagtgggtttcatgaaatcaaaggtttagaggatgcgattgatttgtt 
cagtgacatgcttcgatctcgtcctttaccttctgtggttgatttctgta 
aattgatgggtgtggtggtgagaatggaacgcccggatcttgtgatttct 
ctctatcagaagatggaaaggaaacagattcgatgtgatatatacagctt 
caatattctgataaaatgtttctgcagctgctctaagctcccctttgctt 
tgtctacatttggtaagatcaccaagcttggactccaccctgatgttgtt 
accttcaccaccctgctccatggattatgtgtggaagatagggtttctga 
agccttggatttttttcatcaaatgtttgaaacgacatgtaggcccaatg 
tcgtaaccttcaccactttgatgaacggtctttgccgcgagggtagaatt 
gtcgaagccgtagctctgcttgatcggatgatggaagatggtctccagcc 
tacccagattacttatggaacaatcgtagatgggatgtgtaagaagggag 
atactgtgtctgcactgaatctgctgaggaagatggaggaggtgagccac 
atcatacccaatgttgtaatctatagtgcaatcattgatagcctttgtaa 
agacggacgtcatagcgatgcacaaaatcttttcactgaaatgcaagaga 
aaggaatctttcccgatttatttacctacaacagtatgatagttggtttt 
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t gt agct ctggt aga tggagcgacgcggagcagt tgt tgcaagaaat gt t 

agaaaggaagatcagccctgatgttgtaacttataatgctttgatcaatg 

catttgtcaaggaaggcaagttctttgaggctgaagaattatacgatgag 

atgcttccaaggggtataatccctaatacaatcacatatagttcaatgat 

cgatggattttgcaaacagaatcgtcttgatgctgctgagcacatgtttt 

atttgatggctaccaagggctgctctcccaacctaatcactttcaatact 

ctcatagacggatattgtggggctaagaggatagatgatggaatggaact 

tctccatgagatgactgaaacaggattagttgctgacacaactacttaca 

acactcttattcacgggttctatctggtgggcgatcttaatgctgctcta 

gaccttttacaagagatgatctctagtggtttgtgccctgatatcgttac 

ttgtgacactttgctggatggtctctgcgataatgggaaactaaaagatg 

cattggaaatgtttaaggttatgcagaagagtaagaaggatcttgatgct 

agtcaccccttcaatggtgtggaacctgatgttcaaacttacaatatatt 

gatcagcggcttgatcaatgaagggaagtttttagaggccgaggaattat 

acgaggagatgccccacaggggtatagtcccagatactatcacctatagc 

tcaatgatcgatggattatgcaagcagagccgcctagatgaggctacaca 

aatgtttgattcgatgggtagcaagagcttctctccaaacgtagtgacct 

ttactacactcattaatggctactgtaaggcaggaagggttgatgatggg 

ctggagcttttctgcgagatgggtcgaagagggatagttgctaacgcaat 

tacttacatcactttgatttgtggttttcgtaaagtgggtaatattaatg 

gggctctagacattttccaggagatgatttcaagtggtgtgtatcctgat 

accattaccatccgcaatatgctgactggtttatggagtaaagaggaact 

aaaaagggcagtggcaatgcttgagaaactgcagatgagtatggtatgta 

agtttctgttcagtctatgtattttttatataaacaagaatgtatacatt 

cttttgtgtgtagcttcagattga 

(SEQ ID NO: 89) 
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1. An isolated nuclear fertility restorer nucleic acid selected from the group consisting 
of: 

a) a nucleic acid comprising a gene within positions 88,073 and 198,041 of the 
nucleotide sequence of SEQ ID NO:87; 

b) a nucleic acid encoding a protein comprising an amino acid sequence encoded 
by a gene within positions 88,073 and 198,041 of the nucleotide sequence of SEQ ID 
NO:87; 

c) a nucleic acid comprising a nucleotide sequence differing from the sequence 
of the nucleic acids of a) or b) due to the degeneracy of the genetic code; 

d) a nucleic acid fragment of any of a) through c) above at least 150 nucleotides 
in length; and 

e) a nucleic acid having at least 70% homology with a gene within positions 
88,073 and 198,041 of the nucleotide sequence of SEQ ID NO:87, wherein sequence 
homology is determined by the Karlin and Altschul algorithm using standard 
parameters. 

2. The nucleic acid of Claim 1, wherein the nucleic acid comprises a nucleotide 
sequence selected from SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID 
NO:42, SEQ ID NO:44, SEQ ID NO:52, SEQ ID NO:54 and SEQ ID NO:89. 

3. The nucleic acid of Claim 1, wherein the nucleic acid comprises a nucleotide 
sequence of SEQ ID NO:32. 

4. The nucleic acid of Claim 1, wherein the nucleic acid comprises a nucleotide 
sequence of SEQ ID NO:89. 

5. The nucleic acid of Claim 1, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence selected from SEQ ID NO:29, SEQ E> NO:31, SEQ ID NO:33, 
SEQ ID NO:41, SEQ ID NO:43, SEQ ID NO:51, SEQ ID NO:53 and SEQ ID NO:88. 

6. The nucleic acid of Claim 1 , wherein the nucleic acid encodes a protein comprising an 
amino acid sequence of SEQ ID NO:31. 
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7. The nucleic acid of Claim 1, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence of SEQ ID NO:89. 

8. The nucleic acid of Claim 1, wherein the nucleic acid comprises a nucleotide 
sequence differing from the sequence of the nucleic acids of a) or b) of Claim 1 due to 
the degeneracy of the genetic code. 

9. The nucleic acid of Claim 1, wherein the nucleic acid is at least 150 nucleotides in 
length and is a fragment of any of the nucleic acids of a) through c) of Claim 1 . 

10. The nucleic acid of Claim 9, wherein the nucleic acid encodes a protein having a 
pentatricopeptide motif. 

1 1 . The nucleic acid of Claim 1, wherein the nucleic acid has at least 70% homology with 
the nucleotide sequence selected from SEQ ID NO:30, SEQ ID NO:32, SEQ ID 
NO:34, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:52, SEQ ID NO:54 and SEQ ID 
NO:89 and wherein sequence homology is determined by the Karlin and Altschul 
algorithm using standard parameters. 

12. The nucleic acid of Claim 1, wherein the nucleic acid encodes a protein having a 
pentatricopeptide motif. 

13. A vector comprising a nucleic acid of Claim 1. 

14. A first isolated nucleic acid that hybridizes under highly stringent conditions to a 
second nucleic acid selected from the group consisting of: 

a) a second nucleic acid comprising a nucleotide sequence selected from SEQ ID 
NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:42, SEQ ID NO:44, SEQ ID 
NO:52, SEQ ID NO:54 and SEQ ID NO:89; and 

b) a second nucleic acid encoding a protein comprising an amino acid sequence 
selected from SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, SEQ ID NO:41, SEQ 
ID NO:43, SEQ ID NO:51, SEQ ID NO:53 and SEQ ID NO:88, 

wherein the first nucleic acid encodes a nuclear fertility restorer protein. 
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15. A transgenic plant cell comprising a nucleic acid of Claim 1 . 



16. A transgenic plant comprising a plant cell of Claim 15. 

17. The transgenic plant of Claim 16, wherein the plant is a hybrid. 

18. The transgenic plant of Claim 16 or 17, wherein the plant is a Brassica napus plant. 

19. A seed produced by the plant of Claim 18. 

20. The seed of Claim 19, wherein the nucleic acid comprises a nucleotide sequence 
selected from SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:42, SEQ 
ID NO:44, SEQ ID NO:52, SEQ ID NO:54 and SEQ ID NO:89. 

21. The seed of Claim 19, wherein the nucleic acid comprises a nucleotide sequence of 
SEQIDNO:32. 

22. The seed of Claim 19, wherein the nucleic acid comprises a nucleotide sequence of 
SEQIDNO:89. 

23. The seed of Claim 19, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence selected from an SEQ ID NO:29, SEQ ID NO:31, SEQ ID 
NO:33, SEQ ID NO:41, SEQ ID NO:43, SEQ ID NO:51, SEQ ID NO:53 and SEQ ID 
NO:88. 

24. The seed of Claim 19, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence of SEQ ID NO:3 1 . 

25. The seed of Claim 19, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence of SEQ ID NO:88. 

26. A method of producing a hybrid plant, comprising crossing a male-sterile plant with a 
male-fertile plant, collecting hybrid seed from the male-sterile plant and regenerating 
the hybrid plant from the seed, wherein the male-fertile plant comprises a nuclear 
fertility restorer nucleic acid selected from the group consisting of: 
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a) a nucleic acid comprising a gene within positions 88,073 and 198,041 of the 
nucleotide sequence of SEQ ID NO:87; 

b) a nucleic acid encoding a protein comprising an amino acid sequence encoded 
by a gene within positions 88,073 and 198,041 of the nucleotide sequence of SEQ ID 
NO:87; 

c) a nucleic acid comprising a nucleotide sequence differing from the sequence 
of the nucleic acids of a) or b) due to the degeneracy of the genetic code; 

d) a nucleic acid fragment of any of a) through c) above at least 150 nucleotides 
in length; and 

e) a nucleic acid having at least 70% homology with a gene within positions 
88,073 and 198,041 of the nucleotide sequence of SEQ ID NO:87, wherein sequence 
homology is determined by the Karlin and Altschul algorithm using standard 
parameters. 

27. The method of Claim 26, wherein the nucleic acid comprises a nucleotide sequence 
selected from SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:42, SEQ 
ID NO:44, SEQ ID NO:52, SEQ ID NO:54 and SEQ ID NO:89. 

28. The method of Claim 26, wherein the nucleic acid comprises a nucleotide sequence of 
SEQIDNO:32. 

29. The method of Claim 26, wherein the nucleic acid comprises a nucleotide sequence of 
SEQIDNO:89. 

30. The method of Claim 26, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence selected from SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, 
SEQ ID NO:41, SEQ ID NO:43, SEQ ID NO:51, SEQ ID NO:53 and SEQ ID NO:88. 

31. The method of Claim 26, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence of SEQ ID NO:3 1 . 

32. The method of Claim 26, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence of SEQ ID NO:89. 
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33. The method of Claim 26, wherein the nucleic acid comprises a nucleotide sequence 
differing from the sequence of the nucleic acids of a) or b) of Claim 1 due to the 
degeneracy of the genetic code. 

34. The method of Claim 26, wherein the nucleic acid is at least 150 nucleotides in length 
and is a fragment of any of the nucleic acids of a) througji c) of Claim 1. 

35. The method of Claim 26, wherein the nucleic acid has at least 70% homology with 
the nucleotide sequence selected from SEQ ID NO:30, SEQ ID NO:32, SEQ ID 
NO:34, SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:52, SEQ ID NO:54 and SEQ ID 
NO:89 and wherein sequence homology is determined by the Karlin and Altschul 
algorithm using standard parameters. 

36. The method of Claim 26, wherein the hybrid plant is a Brassica napus plant. 

37. The method of Claim 26, wherein the male-sterile plant comprises a cytoplasmic male 
sterility determinant selected from the group consisting of pol, nap, Tournefortii, 
Kosena and ogu. 

38. The method of Claim 37, wherein the cytoplasmic male sterility determinant is ogu. 

39. A method of increasing production of viable pollen in a plant, comprising introducing 
a nuclear fertility restorer nucleic acid into the plant, wherein the nuclear fertility 
restorer nucleic acid selected from the group consisting of: 

a) a nucleic acid comprising a gene within positions 88,073 and 198,041 of the 
nucleotide sequence of SEQ ID NO:87; 

b) a nucleic acid encoding a protein comprising an amino acid sequence encoded 
by a gene within positions 88,073 and 198,041 of the nucleotide sequence of SEQ ID 
NO:87; 

c) a nucleic acid comprising a nucleotide sequence differing from the sequence 
of the nucleic acids of a) or b) due to the degeneracy of the genetic code; 

d) a nucleic acid fragment of any of a) through c) above at least 150 nucleotides 
in length; and 

e) a nucleic acid having at least 70% homology with a gene within positions 
88,073 and 198,041 of the nucleotide sequence of SEQ ID NO:87, wherein sequence 
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homology is determined by the Karlin and Altschul algorithm using standard 
parameters. 



40. The method of Claim 39, wherein the plant comprises a cytoplasmic male-sterility 
determinant. 

41. The method of Claim 40, wherein the plant comprises an ogu cytoplasmic male 
sterility detenninant. 

42. The method of Claim 39, wherein the plant is a Brassica napus plant. 

43. The method of Claim 39, wherein the nucleic acid comprises a nucleotide sequence 
selected from SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34, SEQ ID NO:42, SEQ 
ID NO:44, SEQ ID NO:52, SEQ ID NO:54 and SEQ ID NO:89. 

44. The method of Claim 39, wherein the nucleic acid comprises a nucleotide sequence of 
SEQDDNO:32. 

45. The method of Claim 39, wherein the nucleic acid comprises a nucleotide sequence of 
SEQIDNO:89. 

46. The method of Claim 39, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence selected from SEQ ID NO:29, SEQ ID NO:31, SEQ ID NO:33, 
SEQ ID NO:41, SEQ ID NO:43, SEQ ID NO:51, SEQ ID NO:53 and SEQ ID NO:88. 

47. The method of Claim 39, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence of SEQ ID NO:31. 

48. The method of Claim 39, wherein the nucleic acid encodes a protein comprising an 
amino acid sequence of SEQ ID NO:88. 

49. The method of Claim 39, wherein the nucleic acid comprises a nucleotide sequence 
differing from the sequence of the nucleic acids of a) or b) of Claim 1 due to the 
degeneracy of the genetic code. 
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50. The method of Claim 39, wherein the nucleic acid is at least 150 nucleotides in length 
and is a fragment of any of the nucleic acids of a) through c) of Claim 1. 



51. The method of Claim 39, wherein the nucleic acid has at least 70% homology with a 
nucleotide sequence selected from SEQ ID NO:30, SEQ ID NO:32, SEQ ID NO:34 5 
SEQ ID NO:42, SEQ ID NO:44, SEQ ID NO:52, SEQ ID NO:54 and SEQ ID NO:89 
and wherein sequence homology is determined by the Karlin and Altschul algorithm 
using standard parameters. 
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Fig. 9 Gene 16 cDNA and Protein 
10 20 30 40 50 60 70 

....|....|.-..|..-.|....|..,.|....|....|....|....|....|....|....|....| 
atgaggattgatgtttctgagccagagctatgcggttgcgacacttgtgtccagcatcggactttcatta 
MRIDVSEPELCGCDTCVQHRTFI 
5 10 15 20 

80 90 100 110 120 130 140 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 

ctcaagaaaccgagccgagcaaagaagtgattggctcatcggttcctgttagttccgaaccagttcaacc 
TQETEPSKEVIGSSVPVSSEPVQP 
25 30 35 40 45 

150 160 170 180 190 200 210 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 

tcttggttccacctcagatgagagttcaggaacagagacgactccactcgctcctcctccagtcaccaca 
LGSTSDESSGTETTPLAPPPVTT 
50 55 60 65 70 

220 230 240 250 260 270 280 

....|....|....|-..-|....|....|....|-...|.-..|....|...,|....|....|....| 
ccggttaataatcctgaaccagcagcgcagtctgttggctcaaccatcccacctgctgttacaccagtta 
PVNNPEPAAQSVGSTIPPAVTP V 
75 80 85 90 

290 300 310 320 330 340 350 

— I — i — | — | — | — | — | — | — | — | — | — | — | — | 

gttccgaacaaccagcacaagctcttggttccacctcggat^aaagttccggtacagagaccactccact 
SSEQPAQALGSTSDQSSGTETTPL 
95 100 105 110 115 

360 370 380 390 400 410 420 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 

cgctcctcctatcaccacgtcggttaagtctgttgactcgaccatcttcttcaagttcccaccggtacaa 
APPITTSVKSVDSTIFFKFPPVQ 
120 125 130 135 140 

430 440 450 460 470 480 490 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 

gcacaagctcttgcccctactgcttccggttcaacgcaagcccctgcttttggttttggtgcattcgctg 
AQALA PTASGSTQAPAFGFGAFA 
145 150 155 160 

500 510 520 530 540 550 560 

....|....|....|....|....| — 

ctcgcgtaccatctgccacctccggttgttcagcatttagtttcgcccctcctgttacatcggcaccagt 
ARVPSATSGCSAFSFAPPVTSAP V 
165 170 175 180 185 

570 580 590 600 610 620 630 

— 1....| — | — | — | — | — | — | — | — | — | — | — | — | 

gcaagctctaggcacaaccactactactactactactacatccgcggccgctcctgcatctccatttcac 
QALGTTTTTTTTTSAAAPASPFH 
190 195 200 205 210 

640 650 660 670 680 690 700 

....|....|....|.. ..|....|....|.... |....|....|.. ..|.... |.... |. | 

agttcctcaccaaccacattccaattccctcctgcttttacatcccttgctgcttctacttttccttctg 
SSSPTTFQFPPAFTSLAASTFPS 
215 220 225 230 

710 720 730 740 750 760 770 

ttgcatcatcaacttcatctccacttgatgctcctccctcaccatttagatggggatcactgcaagctaa 
VA SStSSPLDAPPSPFRWGSLQAN 
235 240 245 250 255 
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Fig. 9 Continued 
780 790 800 810 820 830 840 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 

cacttccccaccctttagcttcttgccagcgcaaggttctgacaagactggttctgcttttactccaccg 
TSPPFSPLPAQGSDKTGSAFTPP 
260 265 270 275 280 

850 860 870 880 890 900 910 

....|....|.-..|....|....|....|....|....|....|....|.-..|....|....|....| 
tttggctaccctggtggttttgccagacctgatgttggtgtctctcatccagggtttggtccctctaacc 
FGYPGGFARPDVGVSHPGFGPSN 
285 290 295 300 

920 930 940 950 960 970 980 

....|....|....|....|....|....|.-..|..-.|....|....|.-..|....|..:.|....| 

attttggaccaaacgcaccaactactacacctgttcctgttcgcagtccatttttggctggtggtggaac 
HFGPNAP TTTPVPVRS PFLAGGGT 
305 310 315 320 325 

990 1000 1010 1020 1030 1040 1050 

tgaacaaggtagtaggtatcctcgttattcacctacaccagatgttgacggcaggctgataatgtccata 
EQGSRYPRYSPTPDVDGRLIMSI 
330 335 340 345 350 

1060 1070 1080 1090 1100 1110 1120 

: ...|....|....|....|....|....|....|....|....|....|....|....|....|....| 

tctgcttccaactcacatggacataaaagtcatgaagaattgaggtgggaagattacaaaaatggagaca 
SASNSHGHKSHEELRWEDYKNGD 
355 360 365 370 

1130 1140 1150 1160 1170 1180 1190 

aaggtgggtttgggtggtttcctcctgttcatacatctcccttttcctcaccaacggtatcaccgtcgct 
KGGFGWFPPVHTSPFSSPTVSPSb 
375 380 385 390 395 

1200 1210 1220 1230 1240 1250 1260 

— I — 1....| — | — | — 1....| — | — | — | — | — | — | — | 

atttgctcctccaagcatacctaatcgtcctcagatgagaactattgatctaacgaaccgagacatgtgt 
FAPPS I PNRPQMRTIDLTNRDM C 
400 405 410 415 420 

1270 1280 1290 1300 1310 1320 1330 

|....|....| |....|....|....|....| |....|....|....| | | 

ggttttcctattggctacaacacccccgctgctttccagagaccccctgaacccgctggtgtttcttccc 
GFPIGYNTPAAFQRPPEPAGVSS 
425 430 435 440 

1340 1350 1360 1370 1380 1390 1400 

cagcatctggatgcacagcgtgtggagccacgagtaggtcctctccttctagtcacttgggcttgaacaa 
PASGCTACGATSRSSPSSHLGLNN 
445 450 455 460 465 

1410 1420 1430 1440 1450 1460 1470 

.... i.. .. |. ... |.... |.... |.. i. ...|... .|....|.... |....|....| ....|..;.| 

taccacaaatcctccatcagctgcgacatctcttcccgggatgttcttttctacctatggttcttgtcct 
TTNPPSAATSLPGMFFSTYGSCP 
470 475 480 485 490 

1480 1490 1500 1510 1520 1530 1540 

ttgctgtttggctcaccaaatcttgcaacttatggtacaacagcaattccagcagtccaagcctatgcta 
LLFGS pnlatygttai pavqay a 

495 500 505 510 
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Fig. 9 Continued 

1550 1560 1570 1580 1590 1600 1610 

....|....|.... |....|.... |....|....|....|....|....|....|....|....|.... | 

ttatgtttggggctccaaattttacttctcaaggtacaacggcaactccagcttttcaagcctttcctat 
IMFGAPNFTSQGTTATPAFQAFPI 
515 520 525 530 535 

1620 1630 1640 1650 1660 1670 1680 

tatgtttgggactccaaatcttgctgctcaaggtactacaagagctccagctgttcaagcctatcctacg 
MFGTPNLAAQGTTRAPAVQAYPT 
540 545 550 555 560 

1690 1700 1710 1720 1730 1740 1750 

....|....|....|....|....|....|,...|....|....|....|....|....|....|....| 
atgtttggcacgccaaatattggagttcaaggttcaactccagcagctcaaacctatcctttgatgtttg 
MFGTPNIGVQGSTPAAQTYPLMF 
565 570 575 580 

1760 1770 1780 1790 1800 1810 1820 

....|....|....|....|....|....|....|....|....|....|,...|....|....|....| 
gcaccccaaatcttgctgctcaaggtacaacaaatattggagctcgaggtacaactccagcagctcaagc 
GTPNLAAQGTTNIGARGTTPAAQA 
585 590 595 600 605 

1830 1840 1850 1860 1870 1880 1890 

ctatccgttgatgtttggcaccccaaatcttgctgctcaaggtacaacaactccagcagttcagtcctat 
YPLMFGTPNLAAQGTTTPAVQSY 
610 615 620 625 630 

1900 1910 1920 1930 1940 1950 1960 

....|.....|..-.|....|....|....|....|....|....|....|....|....|....|....| 
cctacgatgtttggaacaccaaatctagctggtcaaagtacaacaacaactcgagcaggtcagccatatc 
PTMFGTPNLAGQSTTTTRAGQP Y 
635 640 645 650 

1970 1980 1990 2000 2010 2020 2030 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 

ctacgacgtttgctgttcctcaagctgcgacagctccagcagttcagccgtatgctatgatgtttggtac 
PTTFAVPQAATAPAVQPYAMMFGT 
655 660 665 670 675 

2040 2050 2060 2070 2080 2090 2100 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 

accaagtctcggagctcaagatatcactccaggaggtcaagcctatcccgctcatggtttaactctccca 
PSLGAQDITPGGQAYPAHGLTLP 
680 685 690 695 700 

2110 2120 
....|....|....|....|.... 

ttcgccgccatgagtctgcagtaa 2124 (SEQ ID NO: 32) 
FAAMSLQ* (SEQ ID NO: 31) 

705 
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Fig. 13 Gene 26 cDNA and Peptide 
10 20 30 40 50 60 70 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 
atgttggctagggtttgtggattcaagtgttcttcttctcctgctgagtctgcggctagattgttctgta 
MLARVCGFKCSSSPAESAARLFC 
5 10 15 20 

80 90 100 110 120 130 140 

....|....|.^.|....|....|....|....|....|....|....|....|....|.,..|....| 
cgagatcgattcgtgatactctggccaaggcaagcggagagagttgcgaagcaggttttggaggagagag 
TRS IRDTLAKASGESCEAGFGGES 
25 30 35 40 45 

150 160 170 180 190 200 210 

| | | |....| | |....| | | |....| | | 

tttgaagctgcaaagtgggtttcatgaaatcaaaggtttagaggatgcgattgatttgttcagtgacatg 
LKLQSGFHEI KGLEDAIDLFSDM 
50 55 60 65 70 

220 230 240 250 260 270 280 

.... |.. J. ...|....|... .|....|....|....|....|....|....|....|.... | 
cttcgatctcgtcctttaccttctgtggttgatttctgtaaattgatgggtgtggtggtgagaatggaac 
LRSRPLPSVVDFCKLMGVVVRME 
75 80 85 90 

290 300 310 320 330 340 350 

....|....|....|....|-..-|....|....|....|....|...,|....|....|....|....| 
gcccggatcttgtgatttctctctatcagaagatggaaaggaaacagattcgatgtgatatatacagctt 
RPDLVI SLYQKMERKQIRCDIYSF 
95. 100 105 110 115 

360 370 380 390 400 410 420 

....|....|....|....|....|....|....|....|....|....|....|....|,...|....| 

caatattctgataaaatgtttctgcagctgctctaagctcccctttgctttgtctacatttggtaagatc 
NILIKCFCSCSKLPFALSTFGKI 
120 125 130 135 140 

430 440 450 460 470 480 490 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 

accaagcttggactccaccctgatgttgttaccttcaccaccctgctccatggattatgtgtggaagata 
TKLGLHPDVVTFTTLLHGLCVED 
145 150 155 160 

500 510 520 530 540 550 560 

....|....|....|....|....|....|....[....|....|....|....|....|....|....| 

gggtttctgaagccttggatttttttcatcaaatgtttgaaacgacatgtaggcccaatgtcgtaacctt 
RVSEALDFFHQMFETTCRPNVVTF 
165 170 175 180 185 

570 580 590 600 610 620 630 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 

caccactttgatgaacggtctttgccgcgagggtagaattgtcgaagccgtagctctgcttgatcggatg 
TTLMNGLCREGRIVEAVALLDR M 
190 195 200 205 210 

640 650 660 670 680 690 700 



atggaagatggtctccagcctacccagattacttatggaacaatcgtagatgggatgtgtaagaagggag 
MEDGLQPTQITYGTIVDGMCKKG~ 
215 220 225 230 

710 720 730 740 750 760 770 



atactgtgtctgcactgaatctgctgaggaagatggaggaggtgagccacatcatacccaatgttgtaat 
DTVSALNLLRKMEEVSHI IPNVVI 
23 5 240 245 250 255 
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Fig. 13 Continued 
780 790 800 810 820 830 840 

....|....|....|....|....|....|....|....|....|...-|....|....|....|....| 
ctatagtgcaatcattgatagcctttgtaaagacggacgtcatagcgatgcacaaaatcttttcactgaa 
YSAIIDSLCKDGRHSDAQNLFTE 
260 265 270 275 280 

850 860 870 880 890 900 910 

....|....|....| ? .,.|....|....|....|....|....|....|....|....|....|....| 
atgcaagagaaaggaatctttcccgatttatttacctacaacagtatgatagttggtttttgtagctctg 
MQEKGIFPDLFTYNSMIVGFCSS 
285 290 295 300 

920 930 940 950 960 970 980 

|....|....| |....|....|....|....|....| |....|....|....|....| 

gtagatggagcgacgcggagcagttgttgcaagaaatgttagaaaggaagatcagccctgatgttgtaac 
GRWSDAEQLLQEMLERKI SPDVVT 
305 310 315 320 325 

990 1000 1010 1020 1030 1040 1050 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 
ttataatgctttgatcaatgcatttgtcaaggaaggcaagttctttgaggctgaagaattatacgatgag 
YNALINAFVKEGKFFEAEELYDE 
330 335 340 345 350 

1060 1070 1080 1090 1100 1110 1120 

..•.|,...|....|....|....|....|....|....|....|....|....|....|....|....| 
atgcttccaaggggtataatccctaatacaatcacatatagttcaatgatcgatggattttgcaaacaga 
MLPRGI I PNTITYSSMIDGFCKQ 
355 360 365 370 

1130 1140 1150 1160 1170 1180 1190 

....|....|....|....|.-..|....|....|....|....|....|....|.,..|....|....| 
atcgtcttgatgctgctgagcacatgttttatttgatggctaccaagggctgctctcccaacctaatcac 
NRLDAAE HM FYLMATKGC SPNL I T 
375 380 385 390 395 

1200 1210 1220 1230 1240 1250 1260 

....|.-...|....|....|.-..|....|....|....|....|....|....|.. ..|....|.... | 
tttcaatactctcatagacggatattgtggggctaagaggatagatgatggaatggaacttctccatgag 
FNTLIDGYCGAKRIDDGMELLHE 
400 405 410 415 420 

1270 1280 1290 1300 1310 1320 1330 

atgactgaaacaggattagttgctgacacaactacttacaacactcttattcacgggttctatctggtgg 
MT ETGLVADTTTYNTL IHGFYLV*" 
425 430 435 440 

1340 1350 1360 1370 1380 1390 1400 

|....|....| |....| | |....| | ) | | | | 

gcgatcttaatgctgctctagaccttttacaagagatgatctctagtggtttgtgccctgatatcgttac 
GDLNAALDLLQEMISSGLCPD IVT 
445 450 455 460 465 

1410 1420 1430 1440 1450 1460 1470 



ttgtgacactttgctggatggtctctgcgataatgggaaactaaaagatgcattggaaatgtttaaggtt 
CDTLLDGL CDNGKLKDALEMFKV 
470 475 480 485 490 

1480 1490 1500 1510 1520 1530 1540 



atgcagaagagtaagaaggatcttgatgctagtcaccccttcaatggtgtggaacctgatgttcaaactt 
MQKSKKDLDASHPFNGVEPDVQT 
495 500 505 510 
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Fig- 13 Continued 
1550 1560 1570 1580 1590 1600 1610 

....|....|....|....|....|....|....|....|....|....|....|....|....|....| 

acaatatattgatcagcggcttgatcaatgaagggaagtttttagaggccgaggaattatacgaggagat 
YNI LI SGLINEGKFLEAEEIiYEEM 
515 520 525 530 535 

1620 1630 1640 1650 1660 1670 1680 

...•|....|....|....|....|....|....|.-..|....|.--.|....|....|....|....| 
gccccacaggggtatagtcccagatactatcacctatagctcaatgatcgatggattatgcaagcagagc 
PHRGIVPDTITYS SMIDGLCKQS 

540 545 550 555 560 

1690 1700 1710 1720 1730 1740 1750 

•...|....|....|....|..-.|....|....|....|....|....|-...|.-..|....|....| 
cgcctagatgaggctacacaaatgtttgattcgatgggtagcaagagcttctctccaaacgtagtgacct 
RLDEATQMFDSMGSKS FSPNVVT 
565 570 575 580 

1760 1770 1780 1790 1800 1810 1820 

....|....|....|....|....|....|....|....|....|....|....|....|-...|....| 
ttactacactcattaatggctactgtaaggcaggaagggttgatgatgggctggagcttttctgcgagat 
FTTLINGYCKAGRVDDGLELFCEM 
585 590 595 600 605 

1830 1840 1850 1860 1870 1880 1890 

gggtcgaagagggatagttgctaacgcaattacttacatcactttgatttgtggttttcgtaaagtgggt 
GRRGIVANAITYITLICGFRKV G 
610 615 620 625 630 

1900 1910 1920 1930 1940 1950 1960 

| | ) |....|....|....|....|....| | | |....| | 

aatattaatggggctctagacattttccaggagatgatttcaagtggtgtgtatcctgataccattacca 
NINGALDI FQEMISSGVYPDTIT 
635 640 645 650 

1970 1980 1990 2000 2010 2020 2030 

....|....|..,.|....|....|....|....|....|....|....|....|....|..,.|..,.| 
tccgcaatatgctgactggtttatggagtaaagaggaactaaaaagggcagtggcaatgcttgagaaact 
IRNMLTGIiWSKEELKRAVAMLEKL 
655 660 665 670 675 

2040 2050 2060 2070 2080 2090 2100 

....|....|..,.|....|....|....|....|....|....|....|....|....|..,.|....| 
gcagatgagtatggtatgtaagtttctgttcagtctatgtattttttatataaacaagaatgtatacatt 
QMSMVCKFLFSLCIFYINKNVYI 
680 685 690 695 700 

2110 2120 



cttttgtgtgtagcttcagattga 2124 (SEQ ID NO: 89) 
LLCVASD* (SEQ ID NO: 88) 

705 
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